18 dependents
Package Description Downloads/month
Convert PDF to markdown + JSON quickly with high accuracy 566K
Detect and extract tables to markdown and csv 10K
Docling plugin for Surya OCR 8K
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx... 839
Medical cOmputational Suite for Advanced Intelligent eXtraction 811
Docket Analyzer OCR Utility 462
A sleek, open-source Python library for effortless Retrieval-Augmented Generatio... 420
PDF processing pipeline: remove headers/footers, convert to markdown, and genera... 410
A powerful tool to extract text, tables, charts, and formulas from documents and... 329
soiz is awesome 🤘 231
Ready-to-use Python package designed to extract clean, structured text from scie... 226
經緯・Contexture 经纬万卷,结构古今・Weaving Data from History 经纬古今|用 AI 重塑人文学术的知识基础设施 208
Translate PDF documents using OCR and machine translation 206
Specifind: A Natural Language Processing Tool for Automating Species Occurrence ... 102
Convert PDF to markdown + JSON quickly with high accuracy 93
A separately packaged Marker fork published as marker-vN for converting document... 91
PDF를 Markdown으로 변환하여 저장하는 라이브러리 5
1