PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Multimodal Deep Learning Python Packages

Python packages with the GitHub topic multimodal-deep-learning. Sorted by relevance, with stars and monthly downloads.
jrzaurin
pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

3K 1K 197
theislab
scarches

Reference mapping for single-cell genomics

3K 404 70
thuiar
mmsa-fet

A Tool for extracting multimodal features from videos.

1K 212 27
AI4Finance-Foundation
finrobot

FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀

1K 7K 1K
kyegomez
bitnet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

1K 2K 173
automatika-robotics
roboml

Machine learning models optimized for robotics experimentation and deployment

1K 11 0
kyegomez
pegasusx

PegasusX: The Future of Multimodal Embeddings 🦄 🦄

892 14 5
kyegomez
navit-torch

navit - Pytorch

876 273 15
kyegomez
swarms-torch

swarms-torch - Pytorch

857 139 14
kyegomez
medpalm

MedPalm - Pytorch

677 432 58
kyegomez
pali-torch

Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"

627 95 8
idso-fa1-pathology
vitaminp

VitaminP: a vision transformer-assisted multimodal integration network for pathology cell segmentation

473 8 1
georgepar
slp

Utils and modules for Speech Language and Multimodal processing using pytorch and pytorch lightning

422 22 6
jaisidhsingh
loraclip

A simple and efficient wrapper for LoRA-ifying CLIP.

347 40 2
kyegomez
the-compiler

Seed, Code, Harvest: Grow Your Own App with Tree of Thoughts!

318 145 17
kyegomez
pali3

pali3 - Pytorch

282 146 4
kelechi-c
ripple-net

Text-image search and image tagging library

277 13 1
pytorch-duo
torchmm

PyTorch DataLoader and Abstraction for multi-modal data.

238 0 0
multimind-dev
multimind-sdk

Your SDK solves all of this. One interface. Unified logic. Local + hosted models. Fine-tuning. Agent tools. Enterprise-ready. Hybrid RAG.Star 🌟 if you like it!

216 93 14
kyegomez
cross-attn

The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"

186 37 1
asnelt
mmae

Package for Multimodal Autoencoders in TensorFlow / Keras

178 20 12
kyegomez
kosmos2-torch

Kosmos - Pytorch

164 74 6
kyegomez
vodin

SOTA Classification at scale for UAVs, Drones, and much more

139 5 0
kyegomez
mmqqa

Experiments around using Multi-Modal Casual Attention with Multi-Grouped Query Attention

118 5 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery