Datalab to Chandra: OCR 2 for document intelligence
Summary
Datalab's Chandra OCR 2 is a high-performance OCR model that preserves layout and outputs structured HTML/Markdown/JSON. It supports 90+ languages, forms, and handwriting, with local (HuggingFace) or remote (vLLM) inference, plus a hosted API and benchmark data.