A data processing and analysis pipeline designed to handle various jobs related to data transformation, quality assessment, deduplication, and formatting. The pipeline can be configured and executed using YAML configuration files.
A package to perform quality analyses for Machine Learning models
A simple widget for interactive EDA / QA for those who use Pandas in Jupyter Notebook.