PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
Layout-Parser
layoutparser

A Unified Toolkit for Deep Learning Based Document Image Analysis

155K 6K 533
nanonets
llm-data-converter

Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract

900 7 1
DCC-BS
docling-pp-doc-layout

A Docling plugin for PaddlePaddle PP-DocLayout-V3 model document layout detection.

371 4 0
nanonets
document-data-extractor

Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.

254 7 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery