Accessible, efficient data preprocessing library for pretrain and SFT datasets, including KL3M
DublinCore Python library with zero dependencies