Dependents of av - PyPI Stats

449 dependents

Package	Description	Downloads/month
sglang	SGLang is a high-performance serving framework for large language models and mul...	298.9M
faster-whisper	Faster Whisper transcription with CTranslate2	7.6M
aiortc	WebRTC and ORTC implementation for Python using asyncio	3.4M
livekit-agents	A framework for building realtime voice AI agents 🤖🎙️📹	3.4M
qwen-vl-utils	Qwen3-VL is the multimodal large language model series developed by Qwen team, A...	1.9M
inference-gpu	Turn any computer or edge device into a command center for your computer vision ...	1.1M
qwen-omni-utils	Qwen3-VL is the multimodal large language model series developed by Qwen team, A...	528K
vllm-omni	A framework for efficient model inference with omni-modality models	476K
manim	A community-maintained Python framework for creating mathematical animations.	292K
lerobot	🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning	204K
gllm-inference-binary	A library containing components related to model inferences in Gen AI applicatio...	167K
audiocraft	Audiocraft is a library for audio processing and generation with deep learning. ...	122K
inference	Turn any computer or edge device into a command center for your computer vision ...	116K
bithuman	Real-time avatar engine — 100+ FPS on CPU. Generate lip-synced video, stream liv...	108K
pyrit	The Python Risk Identification Tool for LLMs (PyRIT) is a library used to assess...	83K
audiolab	A streaming audio reader, processor, and writer built on top of soundfile, and P...	67K
pytgcalls	Voice chats, private incoming and outgoing calls in Telegram for Developers	64K
crewai-files	File handling utilities for CrewAI multimodal inputs	53K
indent	Indent is an AI Pair Programmer	51K
uiprotect	Python API for UniFi Protect (Unofficial)	46K
rvc-python	Using RVC via console or python scripts	38K
nerfstudio	All-in-one repository for state-of-the-art NeRFs	34K
llamafactory	Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)	29K
mlx-openai-server	A high-performance API server that provides OpenAI-compatible endpoints for MLX ...	22K
vision-agent	This tool has been deprecated. Use Agentic Document Extraction instead.	21K
py-feat	Facial Expression Analysis Toolbox	20K
physical-ai-av	Developer kit for working with the NVIDIA Physical AI Autonomous Vehicles Datase...	18K
osc-data	data represent, processing	17K
lmms-eval	One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio T...	17K
sendspin	Synchronized audio player for Sendspin servers	16K
jj-pytorchvideo	A deep learning library for video understanding research.	13K
jiminy-py	Fast and light weight simulator of rigid poly-articulated systems.	13K
picamera2	New libcamera based python library	13K
simli-ai	Add your description here	12K
av2	Argoverse 2: Next generation datasets for self-driving perception and forecastin...	11K
warp-beacon	Telegram bot for expanding external media links	11K
musicdl	Musicdl: A lightweight music downloader written in pure python. (轻量级无损音乐下载器，支持数十...	10K
avdeepfake1m	[ACM MM Award] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dat...	10K
pixeltable	Data Infrastructure providing a declarative, incremental approach for multimodal...	10K
autowsgr	战舰少女R全家桶	9K
inference-core	Turn any computer or edge device into a command center for your computer vision ...	9K
zebrazoom	ZebraZoom can be used on fixed background videos to track the heads and tails of...	8K
albums	Command line tool to help manage a library of music albums	7K
vsaiortc	WebRTC and ORTC implementation for Python using asyncio	7K
molmo-utils	A set of helper functions for processing and integrating visual inputs with Molm...	7K
opensportslib	OpenSportsLib is the professional library, designed for advanced video understan...	7K
mm-ctx	Fast, multimodal context for agents.	7K
inference-cpu	Turn any computer or edge device into a command center for your computer vision ...	7K
videosdk	VideoSDK Python SDK	7K
super-slurpy	Super SLURPy: Python version of Speech and Language Ultrasound Research Package	7K