PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Data Analytics Python Packages

Python packages with the GitHub topic data-analytics. Sorted by relevance, with stars and monthly downloads.
snowflakedb
snowflake-snowpark-python

Snowflake Snowpark Python API

52.9M 333 148
aiguofer
gspread-pandas

A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.

397K 408 55
mabel-dev
opteryx

🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.

73K 112 14
dbt-labs
dbt-mcp

A MCP (Model Context Protocol) server for interacting with dbt.

72K 557 114
llnl
llnl-hatchet

Graph-indexed Pandas DataFrames for analyzing hierarchical performance data

55K 35 19
girder
girder-worker

Distributed task execution engine with Girder integration, developed by Kitware

28K 35 32
feldera
feldera

The Feldera Incremental Computation Engine

19K 2K 115
tirthajyoti
mlr

Multiple linear regression with statistical inference, residual analysis, direct CSV loading, and other features

17K 34 11
pathwaycom
pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

15K 63K 2K
mabel-dev
opteryx-core

🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.

13K 112 14
girder
girder-import-tracker

A data management platform for the web, developed by Kitware

10K 453 177
benrutter
wimsey

Easy and flexible data contracts

9K 170 2
BCG-X-Official
gamma-facet

Human-explainable AI.

7K 532 46
ralfbecher
orionbelt-semantic-layer-mcp

MCP server for the OrionBelt Semantic Layer — enables LLMs to explore semantic models, compile queries, and execute analytics via natural language.

6K 2 1
apache
apache-superset-core

Apache Superset is a Data Visualization and Data Exploration Platform

6K 73K 17K
denisecase
datafun-toolkit

Privacy-safe diagnostics, paths, and logging helpers for analytics projects.

5K 1 0
Canner
wren-core-py

The open context engine for AI agents support 15+ data sources. Built on Rust and Apache DataFusion.

4K 661 197
hatchet
hatchet

Analyze graph/hierarchical performance data using pandas dataframes

4K 119 41
desbordante
desbordante

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

3K 477 99
Canner
wren-engine

The open context engine for AI agents support 15+ data sources. Built on Rust and Apache DataFusion.

3K 661 197
xoolive
traffic

A toolbox for processing and analysing air traffic data

3K 482 95
Zen-Reportz
zen-dash

Simple, Fast, Scalable , production grade dashboard application . Right solution for team

3K 14 3
helmholtz-analytics
heat

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python

2K 238 65
Squarespace
datasheets

Read data from, write data to, and modify the formatting of Google Sheets

2K 625 55
    • Data from PyPI, GitHub, ClickHouse, and BigQuery