Amharic tokenizer with BPE-like merges over decomposed fidel (Cython)
A modular toolkit for cleaning and normalizing Amharic text.