PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
megagonlabs
ginza-transformers

Use custom tokenizers in spacy-transformers

34K 16 5
gweidart
rs-bpe

A ridiculously fast Python BPE (Byte Pair Encoder) implementation written in Rust

24K 38 5
Prismadic
llm-magnet

the small distributed language model toolkit. fine-tune state-of-the-art LLMs anywhere, rapidly.

1K 32 4
1kkiRen
tokenizerchanger

Library for manipulating the existing tokenizer.

590 21 1
Hugging-Face-Supporter
tftokenizers

Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels

304 10 4
bimri
precious-nlp

A tokenizer-free NLP library with T-FREE, CANINE, and byte-level approaches

188 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery