PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
WenjieDu
pygrinder

PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at random), MAR (at random), MNAR (not at random), sub sequence missing, and block missing

122K 65 6
Baukebrenninkmeijer
table-evaluator

Evaluate real and synthetic datasets against each other

6K 92 28
DLR-RM
blenderproc

A procedural Blender pipeline for photorealistic training image generation

5K 4K 508
kontextox
datasety

CLI tool for dataset preparation: resize, align, caption, shuffle, synthetic, and mask generation.

4K 2 0
Belval
trdg

A synthetic data generator for text recognition

3K 4K 1K
rasinmuhammed
misata

Python synthetic data generator for realistic multi-table test data, database seeding, and scenario simulation

2K 54 3
MattyB95
jabberjay

🦜 Synthetic Voice Detection

2K 9 1
ZumoLabs
zpy-zumo

Synthetic data for computer vision. An open source toolkit using Blender and Python.

2K 321 35
instana
synctl

CLI Tool for Synthetic Monitoring to Manage Synthetic Test and Locations Easily

1K 7 1
clovaai
synthtiger

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

1K 575 109
meta-llama
synthetic-data-kit

Tool for generating high quality Synthetic datasets

1K 2K 218
OllieBoyne
blendersynth

Synthetic Blender Dataset Production

1K 95 10
eqasim-org
synpp

Synthetic population pipeline package for eqasim

1K 21 16
nhsengland
nhssynth

Package to accompany P41

789 5 3
tellae
bhepop2

Synthetic population enrichment from aggregated data

765 2 1
alfurka
synloc

A Python package to create synthetic data from locally estimated distributions

689 3 0
OmarSamirz
iftg

IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, apply over 10 built-in noise effects, and customize fonts and layouts. IFTG supports all languages and offers endless noise combinations, including custom noise creation.

666 21 2
NLR-Distribution-Suite
nrel-shift

Python package for developing power distribution model using opensource data.

348 6 1
WenjieDu
pycorruptor

PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at random), MAR (at random), MNAR (not at random), sub sequence missing, and block missing

268 65 6
finos
datahub-core

DataHub - Synthetic data library

175 80 11
DocsaidLab
wordcanvas-docsaid

Generating text with custom fonts and styles.

121 0 0
AmadeusITGroup
synthetic-face-masks

A Python library for generating synthetic face mask datasets by mixing facial regions

108 0 0
finos
datahub-core-grovesy

Synthetic data generation tools for financial markets

98 80 11
dynatrace-oss
db-load-generator

Mock database activity and run scalable simulations of database load with as little code as necessary

5 6 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery