PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Ocr Python Python Packages

Python packages with the GitHub topic ocr-python. Sorted by relevance, with stars and monthly downloads.
breezedeus
cnocr

CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

71K 4K 537
ankandrew
fast-plate-ocr

Lightweight & fast OCR models for license plate text recognition.

23K 551 71
StabRise
scaledp

ScaleDP is an Open-Source extension of Apache Spark for Document Processing

5K 18 1
SakuraMathcraft
mathcraft-ocr

A Windows math workspace for screenshot OCR, handwriting-to-LaTeX, editing, preview, and symbolic computation, powered by MathCraft OCR and MathLive.

2K 155 14
LATIS-DocumentAI-Group
documentai-std

DocumentAI-std is a Python library designed to facilitate and standardize document analysis and processing tasks. It offers functionality for handling document elements, performing optical character recognition (OCR), and managing document datasets.

2K 3 0
Navaneeth-Sharma
aksharajaana

A OCR Project for Reading New and Old Kannada Texts

2K 10 3
shibing624
imgocr

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。

1K 131 21
gnana70
ocr-tamil

Python Tamil OCR package

1K 86 15
pk5ls20
easypaddleocr

A simple, optional tool for PaddleOCR Detection, direction classification and recognition on CPU and GPU using torch.

1K 13 2
maxent-ai
ocrpy

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

562 225 11
sethupavan12
llm-markdownify

Convert PDFs, images to high-quality Markdown using Vision LLMs.

425 21 1
FREDERICO23
docling-ocr

A powerful Python package for extracting text from images and documents using the SmolDocling-256M-preview advanced LLM-based models.

398 12 1
Danielnara24
mistral-ocr-gui

A graphical user interface for processing images with the Mistral OCR API.

361 1 0
Anish-M-code
pdftotext3

A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.

336 22 2
sxaxmz
handle-scanned-pdf

No description available

303 0 1
PSPDFKit
nutrient-dws

Python client library for Nutrient Document Web Services API

282 54 0
VerisimilitudeX
ocr-pdf2txt

Use Optical Character Recognition technology to convert scanned PDFs into TXT files locally.

225 1 0
AbsoluteWinter
vocr

Vietnamese OCR

199 0 0
kfur
fineocr

FineScanner Mobile OCR for free

180 2 0
lollococce
pdfer

A Python library to handle the transformation from PDFs to data

141 1 0
jcspeegs
loups

Extract video chapter timestamps and title screens using template matching and OCR - perfect for sports, podcasts, and content creation

138 0 0
sergiocorreia
quipucamayoc

dev repo for article

127 33 5
tjkessler
tesseract-positional

Tool to save positional OCR data to a text file

122 0 0
kartikgill
taco-box

An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR

101 15 3
    • Data from PyPI, GitHub, ClickHouse, and BigQuery