opendatalab/MinerU
Summary
opendatalab/MinerU is an open-source document parsing platform that converts PDFs, images, and Office formats (DOCX, PPTX, XLSX) into machine-readable Markdown/JSON for downstream retrieval and processing. It features a dual-engine approach (VLM with OCR), supports 109 languages, and offers multiple deployment options (CLI, API, FastAPI, Gradio UI, Docker) with CPU and GPU acceleration. The project emphasizes integration with AI workflows (MCP Server, LangChain, RAG frameworks) and provides extensive release notes and licensing information.