PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
kreuzberg-dev
html-to-markdown

High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.

487K 694 55
stefan6419846
hocr-tools-lib

Advanced tools for hOCR integration (library version)

478 4 1
BlueBox-WorldWide
textract-hocr

Convert AWS Textract JSON output to hOCR format for use with document processing tools.

276 0 0
brunomacabeusbr
pyslibtesseract

✏️ Integration of Tesseract for Python using a shared library

212 12 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery