62 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Democratizing AI scientists with ToolUniverse | 126K | |
| "Document AI repo for data science" | 10K | |
| Resolved the issue of incorrect quantity extraction in OCR text caused by a wate... | 4K | |
| This repository contains a Python program designed to extract Optical Character ... | 1K | |
| FREDtools repository | 1K | |
| A libary for office operator | 728 | |
| Open source Python CLI toolkit for conversion, manipulation, Analysis of files (... | 576 | |
| PDF2TEXT LIBRARY : Update extract condition (image/no image) | 467 | |
| 统一的异步工具库,支持文件、数据库、时间、日志等多种操作 | 407 | |
| A custom Flask package with PDF processing tools | 309 | |
| 289 | ||
| Comprehensive MCP server exposing dozens of capabilities to AI agents: multi-pro... | 257 | |
| 220 | ||
| Package to save CSV as PNG | 213 | |
| 182 | ||
| Extracts images from PDFs, stores them in S3, and retrieves based on keyword sea... | 171 | |
| 153 | ||
| Topik-aware semantic search untuk dokumen hukum berbasis vectorstore, dan pembua... | 143 | |
| A Python package to compare files (PDF, docx, images) and generate reports in tx... | 137 | |
| 133 | ||
| A microservice using FastAPI, PostgreSQL, OpenSearch, and LangChain. | 132 | |
| Image converter and resizer for research publication. | 132 | |
| 128 | ||
| 128 | ||
| 125 | ||
| 123 | ||
| Proteus data extractor File | 122 | |
| 121 | ||
| 119 | ||
| 118 | ||
| 113 | ||
| 113 | ||
| 113 | ||
| A library for converting PDF to HTML and vice versa | 109 | |
| 103 | ||
| 102 | ||
| 102 | ||
| 100 | ||
| 99 | ||
| 96 | ||
| 94 | ||
| 92 | ||
| 91 | ||
| 84 | ||
| Convert PDF files to Markdown with OCR support | 82 | |
| Topik-aware semantic search untuk dokumen hukum berbasis vectorstore, dan pembua... | 82 | |
| 81 | ||
| 70 | ||
| Tired of main_fixed.py? File summarizer and context manager for AI agents. / 告别m... | 70 | |
| 66 |