PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Machine Learning Python Packages

Python packages with the GitHub topic machine-learning. Sorted by relevance, with stars and monthly downloads.
huggingface
huggingface-hub

The official Python client for the Hugging Face Hub.

244.7M 4K 1K
scikit-learn
scikit-learn

scikit-learn: machine learning in Python

207.1M 66K 27K
huggingface
transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

143.5M 160K 33K
huggingface
datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

118.9M 21K 3K
pytorch
torch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

84.8M 100K 28K
microsoft
onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

69.1M 20K 4K
nltk
nltk

NLTK Source

60.5M 15K 3K
modal-labs
modal

SDK libraries for Modal

53.1M 472 93
ray-project
ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

52.9M 42K 8K
apache
apache-airflow-providers-common-sql

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

50.3M 45K 17K
dmlc
xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

42.9M 28K 9K
mlflow
mlflow-skinny

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

38.3M 26K 6K
mlflow
mlflow

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

36.4M 26K 6K
pytorch
torchvision

Datasets, Transforms and Models specific to Computer Vision

35.6M 18K 7K
apache
apache-airflow-providers-fab

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

30.2M 45K 17K
streamlit
streamlit

Streamlit — A faster way to build and share data apps.

27.7M 44K 4K
wandb
wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

25M 11K 864
supabase
realtime

Python Client for Supabase. Query Postgres from Flask, Django, FastAPI. Python user authentication, security policies, edge functions, file storage, and realtime data streaming. Good first issue.

25M 3K 479
explosion
thinc

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

24.8M 3K 294
apache
apache-airflow-providers-http

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

24M 45K 17K
tensorflow
tensorflow

An Open Source Machine Learning Framework for Everyone

22.6M 195K 75K
aws
sagemaker

A library for training and deploying machine learning models on Amazon SageMaker

22.5M 2K 1K
apache
apache-airflow-providers-common-compat

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

22.2M 45K 17K
explosion
spacy

💫 Industrial-strength Natural Language Processing (NLP) in Python

22.1M 34K 5K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery