45 dependents
Package Description Downloads/month
Optical character recognition for Japanese text, with the main focus being Japan... 20K
Interface for OuteTTS models. 7K
特許明細書の自動方式チェックツール(CLI/GUI/MCP対応) 3K
2K
Automated Japanese vocabulary mining from anime subtitles. 2K
OCR application for reading manga in Japanese, made for AJATTers 🇯🇵 . 2K
Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporar... 2K
Local-LLM is a llama.cpp server in Docker with OpenAI Style Endpoints. 1K
A collection of all our phonemeizers for dataset construction and inference 1K
UniDic2UD + COMBO-pytorch wrapper for spaCy 1K
A utility to extract vocabulary lists from manga. 856
Plugins to enable usage of HuggingFace Models in ocr_translate 796
Calculate readability scores for Japanese texts. 680
Emacs Annotation and Language Learning tool. 667
A lightweight framework for evaluating visual-language models. 557
A utility to extract vocabulary lists from manga. 540
Align kanji lyrics with romaji karaokes 473
Make your epub books vertical or horizontal. 450
424
Search tool for Japanese text in EPUB and Mokuro files 322
Toolbox for Japanese text. 305
MeCab to pandas 297
Torchless optical character recognition for manga focused Japanese text, lightwe... 291
Edge-based voice assistant using Gemma LLM with STT and TTS capabilities 285
LLM Japanese Kana-Kanji convetor 240
MeloPlus: Advanced Python Library for MeloTts 225
Multilingual phonetic-similarity replacement engine — a proper-noun substitution... 218
Japanese ebook audio subtitle aligner - Create synchronized subtitles from Japan... 196
Scalable Data Preprocessing Tool for Training Large Language Models 185
extracts kanji sentences using ocr for automatic anki flashcards 184
A text classification toolkit 180
Meloplus: Advanced python library for Melotts 165
A Japanese-enhanced semantic search system for your local documents. 148
P2x model wrapper for corporate injection 145
自動更新型日本語新語辞書ライブラリ 132
127
Scikit-learn compatible Japanese text vectorizer for CPU-based Japanese natural ... 113
Provides a minimal PyTorch implementation of SPLADE 113
NViXTTS_pl 103
Scalable data pre processing and curation toolkit for LLMs 71
recognized text scoreing library 67
46
36
Scalable Data Preprocessing Tool for Training Large Language Models 1
1