Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters and massive LLMs.
A Python port of the Fredriksen–Jahren Lexicon Classifier
Lightweight Python library for scraping data via the Twitter search API.