PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
reclamador
document-clipper

A set of utility classes and functions to process documents with Python

2K 4 3
psjinx
html2latex

Convert WYSIWYG HTML to LaTeX with typed ASTs, full table support, and 100% test coverage

1K 17 13
nanonets
llm-data-converter

Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract

900 7 1
herrkaefer
anything2md

Python package and CLI for converting documents to Markdown using Cloudflare Workers AI toMarkdown.

321 1 0
nanonets
document-data-extractor

Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.

254 7 1
scivision
loutils

Cross-platform LibreOffice document conversion and printing

162 29 0
gonzalopezgil
docx2md-cli

High-fidelity Word (.docx) to Markdown converter. Preserves tables (vMerge), footnotes, field codes, bibliography, bold/italic/underline, and numbered lists.

140 0 0
faizkhairi
file2md

Convert PDF and DOCX to clean, grep-friendly Markdown for AI/IDE workflows

125 0 0
YashKasare21
docstream

Professional document conversion library (PDF ↔ LaTeX)

109 2 1
markdownbridge
markdownbridge

Python SDK for the MarkdownBridge OCR API — convert documents and images to Markdown

93 0 0
marimo-marine23
xlmelt

Convert complex Excel files into AI-readable JSON/HTML

77 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery