48 dependents
Package Description Downloads/month
Simple package to extract text with coordinates from programmatic PDFs 2.7M
This package contains the AI models used by the Docling PDF conversion package 2.4M
Get your documents ready for gen AI 206K
Making docling agentic through MCP 41K
llama-index node_parser docling integration 31K
Running Docling as an API service 29K
llama-index readers docling integration 22K
Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling - Minimal d... 13K
Interact with the Deep Search platform for new knowledge explorations and disco... 8K
Python library for Synthetic Data Generation 7K
InstructLab Core package. Use this to chat with a model and execute the Instruc... 7K
VibeSurf: A powerful browser assistant for vibe surfing 5K
Agentic web research tool. Smarter than search, faster than deep research. Searc... 4K
Transform unstructured documents into validated, rich and queryable knowledge gr... 3K
A Rag framework for evaluation 2K
Privacy-first document intelligence engine — converts PDFs, DOCX, PPTX, XLSX, an... 2K
Evaluation of Docling 2K
Extract structured Markdown, tables, figures, and equations from scientific PDFs... 1K
A comprehensive PDF processing toolkit that converts PDFs to markdown with advan... 974
Agent that read, write and edit documents. 876
PaperQA readers implemented using Docling 876
A Multiagent Framework for Generating Multimodal Multihop QA Datasets for RAG Ev... 839
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document represe... 646
A tool for uploading and embedding documents for RAG systems 597
CLI Indexer for Jira, Confluence and Local Files, making knowledge available for... 495
Knowledge Base MCP 475
A set of tools to create synthetically-generated data from documents 417
MCP Server for zmp knowledge base 362
CVAT annotation tools for Docling document processing and evaluation 323
AI Search tools. 303
A Python package with a built-in web application 290
Intelligent PDF file renaming using LLMs (OpenAI, Ollama, etc.) 269
Build document-native LLM applications 267
OptimisedRAG: Simple and Fast Retrieval-Augmented Generation a modified version ... 224
Get your documents ready for gen AI 179
Aliyun Bailian powered audio/video to DoclingDocument transcriber 160
A versatile OCR and document processing command-line tool. 142
A basic document parsing utility. (Markdown/HTML Conversion) 133
Config-driven multi-agent orchestration kit with optional API/UI/MCP extras 97
This is a repository dedicated to curating knowledge on any topic related to you... 86
Langflow is a powerful tool for building and deploying AI-powered agents and wor... 85
A Python library for business PDF related content analysis 78
A Python package with a built-in web application 75
Get your documents ready for gen AI 71
Get your documents ready for gen AI 71
Get your documents ready for gen AI 66
Convert born-digital PDFs into Foundry VTT v13 module compendia. 1
1