16 dependents
Package Description Downloads/month
SoccerNet SDK 18K
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio T... 16K
Evaluation and adaption method for the UNICORN Challenge 2K
[CVPR2024 Highlight] VBench - We Evaluate Video Generation 1K
Evaluating Text-to-Visual Generation with Image-to-Text Generation. 704
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of Deep... 359
[CVPR2024 Highlight] VBench - We Evaluate Video Generation 344
LAVIS - A One-stop Library for Language-Vision Intelligence 309
LAVIS - A One-stop Library for Language-Vision Intelligence 229
[CVPR24] Polos: Multimodal Metric Learning from Human Feedback for Image Caption... 196
Slimmed release mirror of UniTrust for AEN and TruthPrInt. 192
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities 156
131
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More! 119
Video quality evaluation toolkit with comprehensive metrics 87
All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 202... 80