PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
bespokelabsai
bespokelabs-curator

Synthetic data curation for post-training and structured data extraction

42K 2K 140
hiyouga
llamafactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

29K 71K 9K
ContextualAI
gritlm

Generative Representational Instruction Tuning

18K 691 50
datajuicer
py-data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

4K 6K 368
snowmuffin
convmerge

Merge heterogeneous chat/text sources into a single LLM training format (JSONL)

2K 0 1
datadreamer-dev
datadreamer-dev

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

2K 1K 59
sileod
tasksource

Datasets collection and preprocessings framework for NLP extreme multitask learning

2K 195 11
hiyouga
llmtuner

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

2K 71K 9K
haotian-liu
llava-torch

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

815 25K 3K
vincentzed
decontaminate

`decon`, but with python API binding.

803 2 0
stef41
castwright

Generate high-quality synthetic instruction-tuning data from seed examples. Simple API, built-in quality filtering, cost-aware.

772 1 0
hiyouga
lazyllm-llamafactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

394 71K 9K
Luodian
otter-ai

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

359 3K 212
zhuang-li
scar-tool

SCAR: An AI-powered tool for ranking and filtering instruction-answer pairs based on writing quality and style consistency

315 40 4
simplifine-llm
simplifine-alpha

An easy to use, open-source LLM finetuning library that handles all the complexities of the process for you.

281 96 4
mohammedaly22
vibeprompt

🦩VibePrompt. A lightweight Python package for adapting prompts by tone, style, and audience. Built on top of LangChain, VibePrompt supports multiple LLM providers and enables structured, customizable prompt transformations for developers, writers, and researchers.

278 9 0
hiyouga
llamafactory-songlab

Easy-to-use LLM fine-tuning framework

96 71K 9K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery