13 dependents
Package Description Downloads/month
🤗 AutoTrain Advanced 34K
Advanced Machine Learning Training Platform - IN DEVELOPMENT 3K
A SapientML plugin of SapientMLGenerator 2K
Tokeniser toolkit: a collection of Pythonic subword tokenisers and text preproce... 2K
UniDic2UD + COMBO-pytorch wrapper for spaCy 1K
A SapientML plugin of preprocess CodeBlockGenerator 643
NLP data augmentation tool 304
Custom pretokenizers for Japanese language models 233
A text classification toolkit 180
Scikit-learn compatible Japanese text vectorizer for CPU-based Japanese natural ... 113
SoftMatcha 112
An util package for myself. (Mostly some classes and functions for NLP) 107
A library of processes for manipulating tabular data in CSV format 72