PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
polm
fugashi

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.

567K 518 39
polm
unidic

Unidic packaged for installation via pip.

370K 109 13
polm
unidic-lite

A small version of UniDic for easy pip installs.

336K 50 3
taishi-i
nagisa

A Japanese tokenizer based on recurrent neural networks

231K 417 23
polm
cutlet

Japanese to romaji converter in Python

89K 376 22
nagataaaas
kanjize

Kanjize(カンジャイズ): Easy converter between Kanji-Number and Integer

63K 68 4
bpwhelan
gamesentenceminer

An immersion toolkit for learning Languages through games and other visual media.

50K 604 34
letuananh
chirptext

ChirpText is a collection of text processing tools for Python.

32K 7 3
tsukumijima
pyopenjtalk-plus

pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvements

30K 57 4
polm
posuto

🏣📮〠 Japanese postal code data.

25K 226 10
kha-white
manga-ocr

Optical character recognition for Japanese text, with the main focus being Japanese manga

20K 3K 133
Byaidu
pdf2zh

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

17K 33K 3K
FlippFuzz
ai-sub

AI-Powered Subtitle Generation with Translation

9K 12 0
rtr46
meikiocr

high-speed, high-accuracy, local ocr for japanese video games

6K 75 3
opencollector
jntajis-python

A fast character conversion and transliteration library based on the scheme defined for Japan National Tax Agency (国税庁) 's corporate number (法人番号) system.

6K 21 0
neosapience
typecast-python

The official SDK for the Typecast API. (Python, JS/TS, C/C++, C#, Java, Kotlin, Go, Rust, Swift, Zig, PHP)

4K 6 0
tsukumijima
fugashi-plus

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis with additional improvements.

4K 4 0
cihai
unihan-etl

Export UNIHAN's database to csv, json or yaml

3K 65 14
34j
account-codes-jp

e-Tax / EDINETタクソノミ / 青色申告 の勘定科目(コード)表のラッパー (非公式)

3K 0 0
miurahr
unihandecode

unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language preference priorities

3K 69 9
aki0ka
meisai-checker

特許明細書の自動方式チェックツール(CLI/GUI/MCP対応)

3K 0 0
kha-white
mokuro

Read Japanese manga inside browser with selectable text.

3K 2K 106
KoichiYasuoka
esupar

Tokenizer POS-tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models for Japanese and other languages

3K 56 7
neocl
jamdict

Python 3 library for manipulating Jim Breen's JMdict, KanjiDic2, JMnedict and kanji-radical mappings

2K 168 18
    • Data from PyPI, GitHub, ClickHouse, and BigQuery