Dependents of decord

70 dependents

Package	Description	Downloads/month
sam3	The repository provides code for running inference and finetuning with the Meta ...	44K
endoreg-db	endoreg-db	25K
naeural-core	These modules form the backbone "bare-metal" version of the Naeural Edge Protoco...	19K
opensportslib	OpenSportsLib is the professional library, designed for advanced video understan...	7K
otx	Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™	5K
vlm-engine	Advanced Vision-Language Model Engine for content tagging	5K
ayase	Modular media quality metrics toolkit.	3K
minicpmo-utils	Unified utilities package for MiniCPM-o: includes stepaudio2 and extensible util...	2K
paddleformers	PaddleFormers is an easy-to-use library of pre-trained large language model zoo ...	2K
dataflow-421		2K
pytorch-image-translation-models	A PyTorch library for multi-modal image translation with diffusion bridges, GANs...	2K
terminalvideoplayer	A TUI video player.	2K
nemo-export-deploy	NeMo Export and Deploy - a library to export and deploy LLMs and MMs	1K
vbench-pruna	[CVPR2024 Highlight] VBench - We Evaluate Video Generation	1K
videoseal	VideoSeal: Video watermarking library by Facebook AI Research.	1K
psifx	Psychological and Social Interactions Feature Extraction	1K
feral	FERAL: Feature Extraction for Recognition of Animal Locomotion	1K
cosmos-predict2	Cosmos-Predict2 is a collection of general-purpose world foundation models for P...	1K
pegasusx	PegasusX: The Future of Multimodal Embeddings 🦄 🦄	909
soccernetpro	SoccerNetPro is the professional extension of the popular SoccerNet library, des...	847
aana	Multimodal SDK	809
annotation-tool	Tool for annotating time series data from sources such as IMUs, MoCap, Videos an...	798
ppvideo	Awesome video understanding toolkits based on PaddlePaddle. It supports video da...	772
dataflow-421-dev		759
filter-sam3-detector		716
t2v-metrics	Evaluating Text-to-Visual Generation with Image-to-Text Generation.	704
controlnet-dwpose	DWPose component from ControlNeXt for whole-body pose estimation	497
scenedataset	PyTorch dataset which uses PySceneDetect to split videos into scenes	454
moviechat	[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understa...	373
ztrack		345
vbench2	[CVPR2024 Highlight] VBench - We Evaluate Video Generation	344
unipercept-reward	[ICML26 Spotlight] UniPercept: Towards Unified Perceptual-Level Image Understand...	334
vlm2vec-for-pyserini	This repo is a fork of the original VLM2Vec repo, modified for easy Pyserini int...	326
ammico-lavis	LAVIS - A One-stop Library for Language-Vision Intelligence	309
nvidia-vlmeval	OpenCompass VLM Evaluation Kit - packaged by NVIDIA	286
giga-datasets	GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation,...	274
vjepa	Moedified from the official PyTorch codebase for the video joint-embedding predi...	272
datanavigator	Interactive data visualization for signals, plots, and videos.	272
lavis-gml	LAVIS - A One-stop Library for Language-Vision Intelligence	229
vjepa-encoder	JEPA research code.	220
livecc-utils	LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 20...	208
erniekit	The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade develo...	202
minicpmo	Unified utilities package for MiniCPM-o: includes cosyvoice + stepaudio2 and ext...	189
video-clip	AskVideos-VideoCLIP model	188
finetrainers	Finetrainers is a work-in-progress library to support (accessible) training of d...	175
starforce		149
fastdeduplicator	A CLI tool designed to find and handle duplicate files	127
lightrft	Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Fra...	120
perception-models-perone	State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!	119
aigve	a Video Quality Analysis Toolkit	118