Chunkr

W24Pivot 1 of 2

3 people|Active|Website

87°Major Pivot

Before

AI Search Engine for Research

After

Open source API service to parse complex documents

Full description — before

AI Search Engine + API We've built search that returns 5x more relevant results compared to Google Scholar. Our search engine is free to use, and we offer an API for LLM focused applications. Database: Over 100M research objects - covering 16 sources types, ~12.5K journals & repositories, and ~65K concepts.

Full description — after

Battle-tested + highly modular vision infrastructure to convert PDFs, PPTs, Word, Excel, PNG, and JPEGs into LLM-ready data. We started by building lumina.sh - where we needed to parse ~600M pages of scientific literature. The researchers didn't care - but devs wanted our ingestion pipeline. So we built chunkr instead. We offer high quality layout analysis, OCR, bounding boxes, granular VLM controls, semantic chunking, and all the last mile engineering that goes into building standout AI applications. Common use-cases include RAG, and automating document workflows like invoices/medical reports -> database.

Category shift

AI Investment & ResearchAI Legal Automation

Summary

The company shifted from building an AI search engine for research (end-user product/search/database) to providing developer infrastructure/APIs for document parsing and vision pipelines—fundamentally different products and user bases.

Detected 1 year ago · 2025-02-13

Company journey — 2 pivots

CurrentPost-training data to teach models document work

105.7°Near Reinvention2026-03-20

Open source API service to parse complex documents(viewing)

87.4°Major Pivot2025-02-13

Started as

AI Search Engine for Research