52 dependents
Package Description Downloads/month
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 14.6M
Google Cloud Client Libraries for Python 82K
A free, open-source expert system for guided interviews and document assembly, b... 46K
A free, open-source expert system for guided interviews and document assembly, b... 21K
This repository contains a Python program designed to execute Optical Character ... 19K
Vision utilities for web interaction agents 13K
"Document AI repo for data science" 10K
Collection of open-source libraries and tools for Robotic Process Automation (RP... 7K
ReMarkable (2 / PaperPro) comprehensive AI / Outlook / Notion Integration System... 5K
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 3K
Suffolk LIT Lab's bleeding edge version of docassemble, a free, open-source expe... 2K
A free, open-source expert system for guided interviews and document assembly, b... 2K
file2txt is a Python library takes common file formats and turns them into plain... 1K
1K
Conjunto de ferramentas para capturar regiões da tela e detectar texto dentro de... 1K
Handprint text recognition in form documents. 1K
GCP offensive assessment and enumeration framework. 924
A Python tool for running OCR on Pokemon Screenshots using Google Cloud Vision 865
This repository contains a Python program designed to execute Optical Character ... 767
FlexiData is an open-source Python package designed for processing unstructured ... 720
yeonghoey yhy
Personal Automation CLI 679
textvision uses the Google Vision API to perform OCR and return text. 582
OCR, Archive, Index and Search: Implementation agnostic OCR framework. 509
AI-based Media and Misinformation Content Analysis Tool: Analyze text and images 470
BigQuery Semantic Search Orchestrator 464
python package to explore the color of language 394
Posting data of Ministers of India. The data is obtained by processing posting o... 391
Automating social science work around image tagging via various online services. 355
Extract quarterly EPS estimates from FactSet Earnings Insight reports using OCR 354
Autodistill Google Cloud Vision module for use in training a custom, fine-tuned ... 313
Multi-provider OCR library with MCP server support 275
include twitter images grabbing, adding labels function 272
Donut packages for evaluating the output of data entry and redact. 270
A cross-platform, local-only vision assistant with OCR and AI analysis 265
A utterly useless package that imports everything for you. Now with top 1000 PyP... 247
Awesome document classifcation - Implementation of major techniques 202
Extracting information from DOCuments INTelligently. 199
yet another tool to help Japanese language learners read text in video games 185
Xtracture is an open source library designed to efficiently extract arbitrary el... 180
Get your documents ready for gen AI 179
Extract quarterly EPS estimates from FactSet Earnings Insight reports using OCR ... 173
This repository contains a Python program designed to execute Optical Character ... 163
Utilities for text detection & document processing 144
Python package with not so common data wrangling functions and API wrappers. 141
A unified, authenticated multi-cloud utility package 121
Google OCR MCP server 107
Find the best ML model for your use case | Y Combinator Fall 2024 99
94
E-commerce data extraction and processing platform with AI-powered enrichment 69
Get your documents ready for gen AI 66