PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
breezedeus
cnocr

CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

70K 4K 537
ankandrew
fast-plate-ocr

Lightweight & fast OCR models for license plate text recognition.

22K 551 71
StabRise
scaledp

ScaleDP is an Open-Source extension of Apache Spark for Document Processing

5K 18 1
SakuraMathcraft
mathcraft-ocr

A Windows math workspace for screenshot OCR, handwriting-to-LaTeX, editing, preview, and symbolic computation, powered by MathCraft OCR and MathLive.

2K 155 14
LATIS-DocumentAI-Group
documentai-std

DocumentAI-std is a Python library designed to facilitate and standardize document analysis and processing tasks. It offers functionality for handling document elements, performing optical character recognition (OCR), and managing document datasets.

2K 3 0
Navaneeth-Sharma
aksharajaana

A OCR Project for Reading New and Old Kannada Texts

2K 10 3
pk5ls20
easypaddleocr

A simple, optional tool for PaddleOCR Detection, direction classification and recognition on CPU and GPU using torch.

1K 13 2
shibing624
imgocr

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。

1K 131 21
gnana70
ocr-tamil

Python Tamil OCR package

1K 86 15
maxent-ai
ocrpy

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

509 225 11
Anish-M-code
pdftotext3

A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.

342 22 2
sethupavan12
llm-markdownify

Convert PDFs, images to high-quality Markdown using Vision LLMs.

327 21 1
FREDERICO23
docling-ocr

A powerful Python package for extracting text from images and documents using the SmolDocling-256M-preview advanced LLM-based models.

321 12 1
sxaxmz
handle-scanned-pdf

No description available

302 0 1
Danielnara24
mistral-ocr-gui

A graphical user interface for processing images with the Mistral OCR API.

284 1 0
PSPDFKit
nutrient-dws

Python client library for Nutrient Document Web Services API

264 54 0
VerisimilitudeX
ocr-pdf2txt

Use Optical Character Recognition technology to convert scanned PDFs into TXT files locally.

210 1 0
AbsoluteWinter
vocr

Vietnamese OCR

194 0 0
kfur
fineocr

FineScanner Mobile OCR for free

186 2 0
lollococce
pdfer

A Python library to handle the transformation from PDFs to data

140 1 0
jcspeegs
loups

Extract video chapter timestamps and title screens using template matching and OCR - perfect for sports, podcasts, and content creation

128 0 0
tjkessler
tesseract-positional

Tool to save positional OCR data to a text file

122 0 0
sergiocorreia
quipucamayoc

dev repo for article

119 33 5
snakers4
silero-ocr

Simple optical character recognition (OCR) by Silero

99 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery