PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
OpenDCAI
dataflex

DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.

566 527 58
georgianpartners
transformers-domain-adaptation

:no_entry: [DEPRECATED] Adapt Transformer-based language models to new text domains

518 85 13
zhuang-li
scar-tool

SCAR: An AI-powered tool for ranking and filtering instruction-answer pairs based on writing quality and style consistency

315 40 4
p-lambda
data-selection

Data Selection with Importance Resampling

255 273 19
4AI
gen-dedup

Code for Generative Deduplication For Socia Media Data Selection (Findings of EMNLP 2024)

119 3 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery