PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
konstantint
passporteye

Extraction of machine-readable zone information from passports, visas and id-cards via OCR

13K 446 122
sivakumar-mahalingam
fastmrz

⚡Extracting the Machine Readable Zone (MRZ) from passport or any document images

4K 176 39
amenezes
aiopytesseract

A Python asyncio wrapper for Tesseract-OCR.

2K 27 7
Navaneeth-Sharma
aksharajaana

A OCR Project for Reading New and Old Kannada Texts

2K 10 3
bhimrazy
receipt-ocr

An efficient OCR engine for receipt image processing.

1K 228 45
Lucs1590
nkocr

This is a module to make specifics OCRs at food products and nutricional tables.

628 39 11
techbyvj
platerecognizepy

License plate recognition library for python

597 3 0
icaropires
pdf2dataset

Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features

596 19 5
maxent-ai
ocrpy

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

509 225 11
Anish-M-code
pdftotext3

A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.

342 22 2
sxaxmz
handle-scanned-pdf

No description available

302 0 1
Saransh-cpp
ocred

Clever, simple, and intuitive wrapper functionalities for OCRing specific textual materials

297 16 3
StabRise
pyspark-pdf

PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it

291 81 4
asiff00
bangla-pdf-ocr

Bangla PDF to text converter that works on Windows, macOS, and Linux without any extra downloads or configurations.

227 21 3
stefan6419846
tesstrain

Tesseract training utilies (Python package)

220 1 1
FriedrichFroebel
ocrodjvu

OCR for DjVu (Python 3 port)

212 10 1
LaurenceWarne
pdf-question-spacer

No description available

173 0 0
ianzhao05
textshot

Python tool for grabbing text via screenshot

157 2K 256
aptakhin
unifex

Unified extraction library for PDF, OCR, and LLM-based document processing

131 0 0
tjkessler
tesseract-positional

Tool to save positional OCR data to a text file

122 0 0
kamidipreetham
verifytweet

A tool to verify Tweet screenshots

111 20 1
phamxtien
streamlit-tesseract-scanner

OCR Scanner use tesseract

81 9 0
AlfredoCubitos
scan2folder

Enables HP-Scan-to-Folder button on Linux

69 2 1
nikhilkumarsingh
pyinrail

Python Wrapper for Indian Railways Enquiry API

41 47 25
    • Data from PyPI, GitHub, ClickHouse, and BigQuery