Tiktoken Python Packages

count-tokens

Count tokens in a text file.

46K 13 0

rs-bpe

A ridiculously fast Python BPE (Byte Pair Encoder) implementation written in Rust

24K 38 5

pyrwkv-tokenizer

A fast RWKV Tokenizer written in Rust

13K 54 5

rwkv-tokenizer

A fast RWKV Tokenizer written in Rust

3K 54 5

pytokencounter

A simple Python library for tokenizing text and counting tokens. While currently only supporting OpenAI LLMs, it helps with text processing and managing token limits in AI applications.

2K 2 0

toksight

Tokenizer analysis toolkit. Compare vocabulary coverage, compression ratios, and token boundaries across GPT-4o, Llama 3, Mistral, and any HuggingFace tokenizer.

695 1 0

promptminify

Minify LLM prompts to save tokens — domain-aware, tiktoken-validated, zero regressions

689 1 0

runtoken

A blazing-fast BPE tokenizer for LLMs. Drop-in tiktoken replacement, 20-80x faster.

502 1 0

openai-function-tokens

Predict the exact openai token usage of functions

414 19 1

slash-tokens

Token Optimization for Context Engineers. 4.8 KB WASM. Sub-millisecond. Zero dependencies.

209 4 0

tokdu

tokdu (Token Disk Usage) is a terminal-based utility that helps you analyze and visualize token usage in your codebase. Similar to the classic du (disk usage) command, tokdu shows you how many tokens your files and directories consume, which is essential when working with Large Language Models (LLMs) that have token limits.

175 5 0

chatgpt-long-term-memory

The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of simultaneous users and external sources.

114 62 3

llm-tokenscope

Profile your LLM payloads. Find the waste. Cut the cost. Field-level token attribution, cost leak detection, and payload optimization for any LLM API.

109 0 0

Search Packages