PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
chrismattmann
tika

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

410K 2K 251
mindee
python-doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

287K 6K 641
felixdittrich92
onnxtr

OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR

75K 178 18
felixdittrich92
docling-ocr-onnxtr

OnnxTR OCR plugin for Docling

26K 19 0
GitHub30
winocr

Windows.Media.Ocr

10K 25 12
rtr46
meikiocr

high-speed, high-accuracy, local ocr for japanese video games

6K 75 3
open-mmlab
mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

5K 5K 777
vkit-x
vkit-nightly

Boosting Document Intelligence

4K 23 1
sivakumar-mahalingam
fastmrz

⚡Extracting the Machine Readable Zone (MRZ) from passport or any document images

4K 176 39
Belval
trdg

A synthetic data generator for text recognition

3K 4K 1K
LATIS-DocumentAI-Group
documentai-std

DocumentAI-std is a Python library designed to facilitate and standardize document analysis and processing tasks. It offers functionality for handling document elements, performing optical character recognition (OCR), and managing document datasets.

2K 3 0
clovaai
synthtiger

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

1K 575 109
Sinapsis-AI
sinapsis-ocr

Sinapsis templates supporting different OCR techniques

898 20 8
Sinapsis-AI
sinapsis-easyocr

Sinapsis templates supporting different OCR techniques

876 20 8
Sinapsis-AI
sinapsis-doctr

Perform optical character recognition using the DocTR library

677 20 8
AI-Riksarkivet
htrflow

HTRflow is the underlying engine for our HTR-pipeline

598 73 12
SeldonHZ
torchfree-ocr

This package is EasyOCR-based optical character recognition. Unlike EasyOCR, the package uses a pre-saved with onnx language models, so it doesn't need a 1-2 Gb pytorch dependency. This is particularly useful for developing and packaging light-weight applications that utilize text recognition.

584 14 5
mindspore-lab
mindocr

A toolbox of OCR models and algorithms based on MindSpore.

544 300 62
Sinapsis-AI
sinapsis-deepseek-ocr

Sinapsis templates supporting different OCR techniques

536 20 8
Sinapsis-AI
sinapsis-glm-ocr

Templates for optical character recognition using the GLM-OCR model

424 20 8
tungedng2710
ton-ocr

Text recognition

271 3 0
18520339
tfseqrec

TensorFlow 2 toolkit for Sequence-level Text Recognition with modules that simplify the steps to process & visualize sequence data, along with common recognition loss functions & evaluation metrics

107 2 0
ajkdrag
ocrtoolkit

Parse bank cheques

91 102 4
mindspore-lab
opensourcedot-mindocr

A toolbox of ocr models and algorithms based on MindSpore

72 300 62
    • Data from PyPI, GitHub, ClickHouse, and BigQuery