33 dependents
Package Description Downloads/month
Document reader with OCR & image detection support. 3K
Librairie outils IA Lexia par Lexfluent 3K
Plagiarism Detection. Simplified. 3K
A tool to remove editorial material from advance sheets. 2K
Open source document management system for digital archives 2K
Ingest sources with proper citation — PDF, URL, media, Office, DJVU 1K
A tool for learning about and pre-processing forms 1K
Hobby Project GUI for the Python Program 'OCRmyPDF' by James R. Barlow 1K
Plugin to run OCRmyPDF with Apple Vision/VisionKit Framework OCR engine (a.k.a. ... 1K
Convert OCRized PDF to text using [OCRmyPDF] 1K
Python library for document preprocessing and information extraction 918
aeiva is a general AI agent framework 693
Host your own local PDF server applying OCR and duplex scanning on your document... 691
OCRmyPDF plugin to generate page SVG files and preview images 644
File Conversor is a Python-based CLI tool to convert, compress, and manipulate f... 608
PDF Processor - a GUI for some common PDF operations. 504
OCRmyPDF plugin using Google Lens API for OCR 450
PaddleOCR engine plugin for OCRmyPDF 432
Chat with GPT in the terminal. 379
Open-source tool for accurate & fast scientific literature data extraction with ... 372
Extracts citations from PDF, URLs and local media files in CSL-JSON. 369
Plugin to run OCRmyPDF with the EasyOCR engine 356
Wowool PDF to Text 315
A PDF pipeline to convert, OCR, and merge documents. 308
To be available soon 262
A module for managing zotfiles files 258
Compress and OCR PDFs in a simple GUI. 247
PIH Recognize service package 210
Intelligent research paper analysis pipeline with LLM-driven categorization 197
PDF text, table, image, and form extraction utilities 116
109
expose a single interface and API to few OCR tools 107
Convert PDF to markdown + JSON quickly with high accuracy 1