148 dependents
Package Description Downloads/month
Lossless conversion of raster images to PDF. 1.9M
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be search... 835K
:fire: The Python library for PDF forms. 130K
Google Cloud Client Libraries for Python 82K
A general day-to-day toolset for PKScreener repos 46K
A free, open-source expert system for guided interviews and document assembly, b... 46K
Simple python wrapper to convert HTML to PDF with headless Chrome via selenium 45K
Base OAREPO package freezeing versions of libraries 39K
:fire: The Python library for PDF forms. 36K
A free, open-source expert system for guided interviews and document assembly, b... 21K
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 A... 17K
An alternative implementation of RML 13K
Convert PDF files to the archival PDF/A format. Supports PDF/A-2b, 2u, 3b, 3u wi... 9K
Ad removal tool for PDFs. 6K
Load networks 6K
A multi format lossless image optimizer that uses external tools 6K
PDF CLI pipeline: merge, split, crop, rotate, compress, extract images, add text... 4K
Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user... 2K
ClicheFactory SDK 2K
Parse bank statement PDFs, extract transactions, and persist to Parquet and SQLi... 2K
Suffolk LIT Lab's bleeding edge version of docassemble, a free, open-source expe... 2K
Fetch an academic paper or web article and send it to the reMarkable tablet with... 2K
PDF.js-based PDF viewer widget for PySide6 with annotation support 2K
PDF.js-based PDF viewer widget for PyQt6 with annotation support 2K
Annex IV-as-Code CLI: generate & validate EU AI Act Annex IV with legal complian... 2K
A PyTorch library for multi-modal image translation with diffusion bridges, GANs... 2K
A free, open-source expert system for guided interviews and document assembly, b... 2K
pd supercharges your development workflows 2K
Structure-based PDF Steganography tool 2K
Format-preserving PDF text editing — edit text in existing PDFs while preserving... 1K
Cross-platform Python client for ARX CoSign electronic signatures via SOAP API 1K
A high-performance PDF processing library with a permissive license. 1K
Interactive TUI for exploring PDF object structure 1K
Tool for analysis of security certificates and their security targets (Common Cr... 1K
Aggressive, safety-first PDF shrinker for scanned medical/legal documents 1K
A tool for learning about and pre-processing forms 1K
Hobby Project GUI for the Python Program 'OCRmyPDF' by James R. Barlow 1K
Viewer for XJustiz files used for file / data transfers between courts, lawyers,... 1K
Plugin to run OCRmyPDF with Apple Vision/VisionKit Framework OCR engine (a.k.a. ... 1K
edit logic numbering and clickable index for existing pdf lacking it based on a ... 1K
Universal AI Agent supporting multiple LLM providers (Anthropic, OpenAI, Gemini,... 1K
Convert OCRized PDF to text using [OCRmyPDF] 1K
Collection of tools to check uploaded scans and records for identifiable data 980
963
Code repository for PDFStitcher, a utility to stitch together and modify line pr... 963
AI-powered CLI for analyzing hardware engineering documents 937
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx... 839
Simple CLI tool for decrypting PDF files. `uvx decryptpdf my.pdf` `pipx run decr... 785
Open-source Python tool to parse credit card PDF statements from Indian banks (H... 698
Host your own local PDF server applying OCR and duplex scanning on your document... 691