PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Data Versioning Python Packages

Python packages with the GitHub topic data-versioning. Sorted by relevance, with stars and monthly downloads.
wandb
wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

25.2M 11K 864
treeverse
lakefs-sdk

lakeFS - Data version control for your data lake | Git for data

1.1M 5K 446
treeverse
lakefs

lakeFS - Data version control for your data lake | Git for data

922K 5K 446
treeverse
lakefs-client

lakeFS - Data version control for your data lake | Git for data

196K 5K 446
laminlabs
lamindb

Open-source data framework for biology. Context and memory for datasets and models at scale. Query, trace & validate with a lineage-native lakehouse that supports bio-formats, registries & ontologies. 🍊YC S22

101K 260 24
quiltdata
quilt3

Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, context-rich data packages.

31K 1K 90
laminlabs
lamindb-core

Open-source data framework for biology. Context and memory for datasets and models at scale. Query, trace & validate with a lineage-native lakehouse that supports bio-formats, registries & ontologies. 🍊YC S22

21K 260 24
layerai
layer

Metadata store for Production ML

17K 88 6
BemiHQ
bemi-sqlalchemy

Automatic data change tracking for SQLAlchemy

4K 6 0
quiltdata
quilt

Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, context-rich data packages.

4K 1K 90
data-as-code
dac

Python Data as Code core implementation

2K 12 1
wandb
wandb-ng

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

1K 11K 864
BemiHQ
bemi-django

Automatic data change tracking for Django

1K 6 0
eliask
farchive

Local content-addressed archive with observation history. Stores bytes by SHA-256, preserves locator state as contiguous spans, compresses with zstd and corpus-trained dictionaries. SQLite-backed.

915 6 0
wandb
wandb-testing

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

287 11K 864
MaratSaidov
artificial-detection

Python framework for artificial text detection with NLP approaches.

279 16 1
treeverse
lakefs-sdk-async

lakeFS - Data version control for your data lake | Git for data

266 5K 446
quiltdata
quilt-installer

Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, context-rich data packages.

177 1K 90
NewronAI
newron

Newron is a data-centric ML platform to easily build, manage, deploy and continuously improve models through data driven development.

170 3 4
NewronAI
newron-sdk

Newron is a data-centric ML platform to easily build, manage, deploy and continuously improve models through data driven development.

136 3 4
quiltdata
quilt-stack-installer

Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, context-rich data packages.

117 1K 90
wandb
tendb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

58 11K 864
wandb
custom-wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

39 11K 864
wandb
wandb-zc

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

37 11K 864
    • Data from PyPI, GitHub, ClickHouse, and BigQuery