PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
jaidedai
easyocr

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

2.7M 29K 4K
sirfz
tesserocr

A Python wrapper for the tesseract-ocr API

365K 2K 259
mindee
python-doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

287K 6K 641
felixdittrich92
onnxtr

OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR

75K 178 18
amenezes
aiopytesseract

A Python asyncio wrapper for Tesseract-OCR.

2K 27 7
gnana70
ocr-tamil

Python Tamil OCR package

1K 86 15
caltechlibrary
handprint

Run handwritten text recognition services on images of documents

781 189 18
OmarSamirz
iftg

IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, apply over 10 built-in noise effects, and customize fonts and layouts. IFTG supports all languages and offers endless noise combinations, including custom noise creation.

666 21 2
by256
imagedataextractor

ImageDataExtractor 2.0 - a Python library for electron microscopy image quantification.

612 21 2
bandrel
ocyara

A Yara rule engine that scans images for matches using Optical Character Recognition (OCR). See the Github page for more information about the Cython, Tesseract, and Leptonica prerequsites.

267 42 8
MartinThoma
hasy

HASY dataset

264 35 11
acsenrafilho
cucaracha

A bureaucratic cockroach (cucaracha) assistent to help in document processing and analysis

259 1 1
JaidedAI
easyocr-itgn

Modified Easyorc By IntoThatGoodNight

252 29K 4K
verifid
mocr

Meaningful Optical Character Recognition from identity cards with Deep Learning.

244 25 6
cneud
alto-tools

Perform various operations on ALTO xml files

228 49 16
olaflaitinen
thulium-htr

Thulium is a production-ready Python library for offline handwritten text recognition (HTR) supporting 52+ languages across Latin, Cyrillic, Greek, Arabic, Hebrew, Devanagari, Chinese, Japanese, Korean, and Georgian scripts.

204 8 0
khasbilegt
numiner

NUM Miner (Tool to create open dataset for Handwritten Text Recognition)

199 4 0
jaidedai
nocv2easyocr

This is a fork of the EasyOCR library without the opencv requirement

180 29K 4K
jaidedai
asone-ocr

End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution

178 29K 4K
jaidedai
myeasyocr

End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution

139 29K 4K
marieai
marie-ai

Python library to Integrate AI-powered features into your applications

136 89 11
18520339
tfseqrec

TensorFlow 2 toolkit for Sequence-level Text Recognition with modules that simplify the steps to process & visualize sequence data, along with common recognition loss functions & evaluation metrics

107 2 0
snakers4
silero-ocr

Simple optical character recognition (OCR) by Silero

99 0 0
kartikgill
taco-box

An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR

99 15 3
    • Data from PyPI, GitHub, ClickHouse, and BigQuery