Light-weight tool for normalizing whitespace and accurately tokenizing words (no regex). Multiple natural languages supported.