PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Pdf Converter Python Packages

Python packages with the GitHub topic pdf-converter. Sorted by relevance, with stars and monthly downloads.
docling-project
docling

Get your documents ready for gen AI

6M 59K 4K
xhtml2pdf
xhtml2pdf

A library for converting HTML into PDFs using ReportLab

3.5M 2K 655
borb-pdf
borb

borb is a library for reading, creating and manipulating PDF files in python.

539K 4K 158
docling-project
docling-slim

Get your documents ready for gen AI

294K 59K 4K
opendatalab
mineru

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

283K 62K 5K
opendataloader-project
opendataloader-pdf

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

112K 20K 2K
opendatalab
magic-pdf

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

76K 62K 5K
abarker
pdfcropmargins

pdfCropMargins -- a program to crop the margins of PDF files

41K 469 41
miikanissi
zebrafy

Python library for converting PDF and images to and from Zebra Programming Language (ZPL).

21K 76 11
aspose-pdf
aspose-pdf

Aspose.PDF for Python via .NET examples and showcase projects

18K 7 0
raphaelmansuy
edgeparse

High-performance PDF-to-structured-data extraction — Rust engine, Python interface

9K 106 13
DS4SD
deepsearch-toolkit

Interact with the Deep Search platform for new knowledge explorations and discoveries

8K 227 32
explosion
spacy-layout

📚 Process PDFs, Word documents and more with spaCy

5K 894 64
opendatalab
mineru-selfhosted-mcp

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

4K 62K 5K
opendataloader-project
langchain-opendataloader-pdf

A LangChain integration for OpenDataLoader PDF

4K 32 3
pankajr141
pdf2jpg

Utility to convert PDF into JPG files

3K 58 22
gastongouron
ironpress

Pure Rust PDF converter, no browser, no external dependencies. Supports HTML with inline CSS, Markdown, and document conversion with a built-in layout engine.

3K 161 9
benjamin-awd
monopoly-core

Monopoly is a Python library & CLI that converts bank statement PDFs to CSV.

2K 128 41
Hugues-DTANKOUO
olgadoc

Python bindings for Olga. PDF, DOCX, XLSX, HTML → Markdown and typed JSON, 15–40× faster than equivalent-quality OSS. Strictly-typed surface, no Any, one abi3 wheel for CPython 3.8+.

2K 6 0
ashutoshvarma
pyxpdf

Fast and memory-efficient Python PDF Parser based on xpdf sources

2K 44 17
stanford-oval
churro-ocr

CHURRO is an OCR toolkit for historical document transcription, built to make handwritten and printed sources readable at high accuracy and lower cost.

948 38 4
benjamin-awd
monopoly-sg

Monopoly is a Python library & CLI that converts bank statement PDFs to CSV.

870 128 41
moria97
fastpdf4llm

Lightweight and fast library to convert PDF to markdown format.

696 1 0
vpoulailleau
md-to-pdf

Yet another Markdown to PDF converter

690 2 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery