OpenDataLoader PDF: PDF Parser for AI-ready data with bounding boxes and accessibility
Summary
OpenDataLoader PDF is an open-source PDF parser designed to produce AI-ready outputs with bounding boxes and structured JSON. It offers deterministic local processing, a hybrid AI-assisted mode for complex pages, OCR for scans, and plans to auto-tag PDFs to Tagged PDFs for accessibility, with LangChain integration and multiple language support.