PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Feature Engineering Python Packages

Python packages with the GitHub topic feature-engineering. Sorted by relevance, with stars and monthly downloads.
feature-engine
feature-engine

Feature engineering and selection open-source Python library compatible with sklearn.

344K 2K 342
alteryx
featuretools

An open source python library for automated feature engineering

219K 8K 906
apache
sf-hamilton

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

170K 2K 187
upgini
upgini

Intelligent data search & enrichment for Machine Learning

77K 347 27
Microsoft
nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

35K 14K 2K
fraunhoferportugal
tsfel

An intuitive library to extract features from time series.

34K 1K 156
EpistasisLab
tpot

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

32K 10K 2K
winedarksea
autots

Automated Time Series Forecasting

25K 1K 123
predict-idlab
tsflex

Flexible time series feature extraction & processing

24K 438 28
dagworks-inc
sf-hamilton-sdk

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

15K 2K 186
gmrukwa
divik

Divisive Intelligent K-Means algorithm (DiviK) for joint feature selection and clustering of heavily multidimensional data.

15K 14 6
apache
apache-hamilton

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

14K 2K 187
alteryx
evalml

EvalML is an AutoML library written in python.

11K 847 93
dagworks-inc
sf-hamilton-ui

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

10K 2K 186
ThomasBury
arfs

All Relevant Feature Selection

10K 143 15
dagworks-inc
sf-hamilton-lsp

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

10K 2K 186
AutoViML
featurewiz

Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshadri. Collaborators welcome.

10K 677 99
mljar
mljar-supervised

Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation

10K 3K 432
pixeltable
pixeltable

Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.

10K 2K 210
MatsMoll
aligned

The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt

9K 61 2
NVIDIA-Merlin
nvtabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

9K 1K 149
scikit-learn-contrib
fastcan

A fast canonical-correlation-based search algorithm for feature selection, system identification, data pruning, etc.

7K 23 5
alibaba
feathub-nightly

A stream-batch unified feature store for real-time machine learning

7K 347 60
SimonBlanke
hyperactive

A unified interface for optimization algorithms and experiments

7K 552 75
    • Data from PyPI, GitHub, ClickHouse, and BigQuery