16 dependents
| Package | Description | Downloads/month |
|---|---|---|
| SoccerNet SDK | 18K | |
| One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio T... | 16K | |
| Evaluation and adaption method for the UNICORN Challenge | 2K | |
| [CVPR2024 Highlight] VBench - We Evaluate Video Generation | 1K | |
| Evaluating Text-to-Visual Generation with Image-to-Text Generation. | 704 | |
| 🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of Deep... | 359 | |
| [CVPR2024 Highlight] VBench - We Evaluate Video Generation | 344 | |
| LAVIS - A One-stop Library for Language-Vision Intelligence | 309 | |
| LAVIS - A One-stop Library for Language-Vision Intelligence | 229 | |
| [CVPR24] Polos: Multimodal Metric Learning from Human Feedback for Image Caption... | 196 | |
| Slimmed release mirror of UniTrust for AEN and TruthPrInt. | 192 | |
| Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities | 156 | |
| 131 | ||
| State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More! | 119 | |
| Video quality evaluation toolkit with comprehensive metrics | 87 | |
| All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 202... | 80 |