1,615 dependents
Package Description Downloads/month
PyMuPDF4LLM 20.7M
PyMuPDF Layout turns PDFs into structured data 10× faster than vision-based tool... 19.4M
Framework for orchestrating role-playing, autonomous AI agents. By fostering col... 6.8M
Open source Python library converting pdf to docx. 836K
Commercial extensions for PyMuPDF; enables Office document handling, including d... 429K
Markdown to pdf renderer 355K
A python library to make filling pdfs much easier 267K
Alacorder retrieves case detail PDFs from Alacourt.com and processes them into d... 101K
Command line interface for quickly creating, authoring, and building PreTeXt doc... 95K
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/J... 77K
Type stubs for PyMuPDF (fitz), automatically generated 74K
An autonomous agent that conducts deep research on any data using any LLM provid... 74K
Yet Another Document Translator 68K
Your AI second brain. Self-hostable. Get answers from the web or your docs. Buil... 61K
pdfCropMargins -- a program to crop the margins of PDF files 42K
Self-hosted semantic search and knowledge management for LLM-driven development 40K
Personal AI agent bot — Telegram + Ollama 38K
Robot Framework DocTest library. Simple Automated Visual Document Testing. 38K
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG 36K
AI Data Vault - A query engine for AI Agents to securely query data from any dat... 36K
IAToolkit 34K
Python tool for converting files and office documents to Markdown. 29K
Research and development (R&D) is crucial for the enhancement of industrial prod... 27K
CLI AI Agent for code and documentation 26K
Agentic knowledgbase - your personal alexandira 26K
Securities Investment Analysis Tools (siat) 26K
Biblioteca para extração inteligente de documentos PDF com IA 24K
Planning, research, and report generation. 23K
Utilities that extend Standard Python. 22K
A faster cpu machine learning library 19K
This package is used for building automation functions for rpa within Bosch 19K
This tool has been deprecated. Use Agentic Document Extraction instead. 19K
Pulse Engine — Hybrid framework for building Pulse products 18K
MCP Server to integrate with SharePoint 18K
Open-source, AI-enhanced CAT tool with multi-LLM support, translation memory, gl... 18K
Convert documentation websites, GitHub repositories, and PDFs into Claude AI ski... 18K
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 A... 17K
GLM-OCR: Accurate × Fast × Comprehensive 16K
Scientific paper search, metadata enrichment, PDF download, and BibTeX library m... 16K
fenic is a Python DataFrame library for processing text data with APIs inspired ... 15K
Nextcloud MCP Server 15K
A comprehensive Python framework for building and serving conversational AI agen... 15K
A easy way to create structured AI agents 14K
General-purpose open-source RAG engine with multi-LLM, hybrid retrieval, GraphRA... 13K
The library provides a set of tools and functions for various subject areas, inc... 13K
This package enables inference of header hierarchy in the docling PDF parsing pi... 13K
FERAL — Open-source AI agent with computer use, GenUI, voice, hardware control, ... 13K
Build, debug, evaluate, and operate AI agents. The only SDK with fork-and-rerun ... 13K
An intelligent agent framework with pluggable skills and LLM integrations 13K
Translate files using Argos Translate 13K