Chunkr is a Y Combinator-backed Document Intelligence API platform specializing in parsing and extracting data from complex documents, transforming PDFs, images, and spreadsheets into LLM-ready formats using advanced OCR and layout analysis technology. The platform converts unstructured documents into structured, machine-readable data with capabilities including PDF parsing, image OCR, spreadsheet processing, layout detection, and table extraction with schema-based extraction supporting multiple output formats (HTML, Markdown, JSON). Chunkr handles handwritten text, forms, mathematical formulas, and technical diagrams while supporting approximately 100 languages for multilingual processing. The platform maintains document structure and reading order, and is SOC2 and HIPAA compliant with customizable data retention policies.
Top companies in RAG Frameworks you can use instead of Chunkr.
Companies from adjacent layers in the AI stack that work well with Chunkr.