PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
pdftl
pdftl

PDF CLI pipeline: merge, split, crop, rotate, compress, extract images, add text and more. Modern pdftk replacement, powered by pikepdf/qpdf.

4K 5 1
Prathamesh-Ghatole
entityxtract

A provider-agnostic, entity-centric LLM-powered document entity extraction tool

562 1 1
mcagriaksoy
safepdf

SafePDF is a privacy-focused offline tool for PDF manipulation. Merge, compress, split, and organize your PDF files securely: No internet required, your documents stay local and safe.

463 6 1
Rekhet
revpdf

A triage-and-recovery toolkit for PDFs saved with incremental updates.

317 0 0
PSPDFKit
nutrient-dws

Python client library for Nutrient Document Web Services API

283 54 0
fujiba
llm-pdf-chunker

LLM-friendly PDF splitter & image optimizer. Chunk PDFs by size and downsample images for RAG/Bedrock.

281 0 0
hksorensen
diagram-detector

Production-ready diagram detection for academic papers using YOLO11

273 1 0
MelinaNorton
journal-vetter

Python CLI & library for automated journal vetting — GPT‑4.1 summarization, YAML configuration, reproducible analysis.

135 1 0
Aleptonic
pdf-snip

A package to help manage pdf pages, images and their conversions during different NLP, CV or other tasks to avoid repetitive code blocks and give a simple function call to make it happen

96 3 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery