1,615 dependents
| Package | Description | Downloads/month |
|---|---|---|
| PyMuPDF4LLM | 20.7M | |
| PyMuPDF Layout turns PDFs into structured data 10× faster than vision-based tool... | 19.4M | |
| Framework for orchestrating role-playing, autonomous AI agents. By fostering col... | 6.8M | |
| Open source Python library converting pdf to docx. | 836K | |
| Commercial extensions for PyMuPDF; enables Office document handling, including d... | 429K | |
| Markdown to pdf renderer | 355K | |
| A python library to make filling pdfs much easier | 267K | |
| Alacorder retrieves case detail PDFs from Alacourt.com and processes them into d... | 101K | |
| Command line interface for quickly creating, authoring, and building PreTeXt doc... | 95K | |
| Transforms complex documents like PDFs and Office docs into LLM-ready markdown/J... | 77K | |
| Type stubs for PyMuPDF (fitz), automatically generated | 74K | |
| An autonomous agent that conducts deep research on any data using any LLM provid... | 74K | |
| Yet Another Document Translator | 68K | |
| Your AI second brain. Self-hostable. Get answers from the web or your docs. Buil... | 61K | |
| pdfCropMargins -- a program to crop the margins of PDF files | 42K | |
| Self-hosted semantic search and knowledge management for LLM-driven development | 40K | |
| Personal AI agent bot — Telegram + Ollama | 38K | |
| Robot Framework DocTest library. Simple Automated Visual Document Testing. | 38K | |
| 📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG | 36K | |
| AI Data Vault - A query engine for AI Agents to securely query data from any dat... | 36K | |
| IAToolkit | 34K | |
| Python tool for converting files and office documents to Markdown. | 29K | |
| Research and development (R&D) is crucial for the enhancement of industrial prod... | 27K | |
| CLI AI Agent for code and documentation | 26K | |
| Agentic knowledgbase - your personal alexandira | 26K | |
| Securities Investment Analysis Tools (siat) | 26K | |
| Biblioteca para extração inteligente de documentos PDF com IA | 24K | |
| Planning, research, and report generation. | 23K | |
| Utilities that extend Standard Python. | 22K | |
| A faster cpu machine learning library | 19K | |
| This package is used for building automation functions for rpa within Bosch | 19K | |
| This tool has been deprecated. Use Agentic Document Extraction instead. | 19K | |
| Pulse Engine — Hybrid framework for building Pulse products | 18K | |
| MCP Server to integrate with SharePoint | 18K | |
| Open-source, AI-enhanced CAT tool with multi-LLM support, translation memory, gl... | 18K | |
| Convert documentation websites, GitHub repositories, and PDFs into Claude AI ski... | 18K | |
| [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 A... | 17K | |
| GLM-OCR: Accurate × Fast × Comprehensive | 16K | |
| Scientific paper search, metadata enrichment, PDF download, and BibTeX library m... | 16K | |
| fenic is a Python DataFrame library for processing text data with APIs inspired ... | 15K | |
| Nextcloud MCP Server | 15K | |
| A comprehensive Python framework for building and serving conversational AI agen... | 15K | |
| A easy way to create structured AI agents | 14K | |
| General-purpose open-source RAG engine with multi-LLM, hybrid retrieval, GraphRA... | 13K | |
| The library provides a set of tools and functions for various subject areas, inc... | 13K | |
| This package enables inference of header hierarchy in the docling PDF parsing pi... | 13K | |
| FERAL — Open-source AI agent with computer use, GenUI, voice, hardware control, ... | 13K | |
| Build, debug, evaluate, and operate AI agents. The only SDK with fork-and-rerun ... | 13K | |
| An intelligent agent framework with pluggable skills and LLM integrations | 13K | |
| Translate files using Argos Translate | 13K |