11 dependents
Package Description Downloads/month
Convert PDF to markdown + JSON quickly with high accuracy 566K
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/J... 282K
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with Duck... 2K
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, Open... 1K
PDF processing pipeline: remove headers/footers, convert to markdown, and genera... 410
Translate PDFs into any language via the Google Translate unofficial API. 333
經緯・Contexture 经纬万卷,结构古今・Weaving Data from History 经纬古今|用 AI 重塑人文学术的知识基础设施 208
Convert PDF to markdown + JSON quickly with high accuracy 93
A separately packaged Marker fork published as marker-vN for converting document... 91
Intelligent document processing. Extract structured data like JSON, Markdown and... 71
A high-performance, open-source PDF data extraction tool. 一站式开源高性能数据提取工具,将复杂 P... 71