PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
chatnoir-eu
fastwarc

A robust web archive analytics toolkit

1.3M 137 18
chatnoir-eu
resiliparse

A robust web archive analytics toolkit

1.2M 137 18
scikit-hep
uproot

ROOT I/O in pure Python and NumPy.

1M 264 94
joseph-fox
pybloom-live

Scalable Bloom Filter implemented in Python

486K 165 25
scikit-hep
uproot3

ROOT I/O in pure Python and NumPy.

330K 313 66
canimus
cuallee

Possibly the fastest DataFrame-agnostic quality check library in town.

106K 243 22
abeusher
timehash

An algorithm for creating user configurable, variable-precision sliding windows of time. Useful for binning time values in large collections of data.

19K 45 14
legend-exp
legend-pydataobj

LEGEND Python Data Objects

13K 1 13
RayforceDB
rayforce-py

High-performance and powerful Python DataFrame library built on top of RayforceDB

9K 14 0
scikit-hep
uproot4

ROOT I/O in pure Python and NumPy.

8K 264 94
ironmussa
optimuspyspark

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

6K 2K 232
abronte
pysparkgateway

Connect to remote Spark clusters seamlessly.

4K 3 4
visualpython
visualpython

Visual Python is a GUI-based Python code generator, developed on the Jupyter Notebook as an extension.

3K 918 119
dbcli
athenacli

AthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.

3K 225 36
databendlabs
databend

Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.

2K 9K 870
bigbio
quantms-utils

A python library with scripts and helpers classes for quantms workflow

2K 6 5
apache
airavata-django-portal-sdk

Apache Airavata Django Portal SDK

2K 0 2
visualpython
jupyterlab-visualpython

GUI-based Python code generator for data science, extension to Jupyter Lab, Jupyter Notebook and Google Colab.

2K 918 119
bigartm
bigartm

Fast topic modeling platform

1K 673 121
hi-primus
pyoptimus

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

1K 2K 232
unum-cloud
ukv

Python bindings for Unum's UStore.

1K 631 35
BROADSoftware
hadeploy

An Hadoop Application Deployment tool

869 9 4
gangly
datafaker

Datafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具

859 643 166
arvados
arvados-tools

A single package to install all Arvados client tools

799 417 126
    • Data from PyPI, GitHub, ClickHouse, and BigQuery