PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
reactor-no8
neots

NeoTextSynthesizer is a high-performance OCR training data generator.

20K 1 0
vkit-x
vkit-nightly

Boosting Document Intelligence

4K 23 1
Open-DataFlow
open-dataflow

Modern Data Centric AI system for Large Language Models

3K 3K 315
DIYer22
bpycv

Computer vision utils for Blender.

1K 501 60
Open-DataFlow
open-dataflow-adp

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

1K 3K 315
sebhaan
tabpfgen

TabPFGen: Synthetic Tabular Data Generation with TabPFN

441 40 6
EtienneChollet
oct-vesselseg

A Label-Free and Data-Free Synthesis Engine and Training Framework for Vascular Segmentation of sOCT Data with PyTorch.

333 6 0
ArenaGrenade
bpycv3d

Blender Python Package for extracting internal data from blender scenes for 3d related data generation purposes.

173 6 0
open-sciencelab
graphg

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

157 1K 82
vkit-x
vkit

In Pursuit Of The Best Synthetic Data Generation

67 23 1
MatthewCYM
gense

Official implementaion of EMNLP 2022 paper "Generate, Discriminate, and Contrast: A Semi-Supervised Sentence Representation Learning Framework"

57 23 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery