PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Multi Modal Python Packages

Python packages with the GitHub topic multi-modal. Sorted by relevance, with stars and monthly downloads.
modelscope
modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

4.1M 9K 934
agentscope-ai
agentscope

Build and run agents you can see, understand and trust.

205K 25K 3K
docarray
docarray

Represent, send, store and search multimodal data

144K 3K 241
awslabs
pyrhubarb

A Python framework for multi-modal document understanding with Amazon Bedrock

44K 103 14
MedMNIST
medmnist

[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification

44K 1K 207
valhalla
pyvalhalla

Open Source Routing Engine for OpenStreetMap

6K 6K 888
lucidrains
dalle-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

5K 6K 643
answerdotai
byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

5K 847 93
OFA-Sys
cn-clip

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

4K 6K 552
lucidrains
transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

4K 1K 71
valhalla
pyvalhalla-weekly

Open Source Routing Engine for OpenStreetMap

4K 6K 888
kyegomez
qwen

My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't released model code yet sooo...

4K 12 2
datajuicer
py-data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

4K 6K 368
zjunlp
deepke

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

3K 4K 742
BrainLesion
brainles-preprocessing

preprocessing tools for multi-modal 3D brain imaging

2K 31 8
avitai
avitai-artifex

A research-focused modular generative modeling library built on JAX/Flax NNX

2K 1 0
kyegomez
vision-llama

Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta

1K 16 0
kyegomez
switch-transformers

Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"

836 139 17
RasmussenLab
move-dl

MOVE (Multi-Omics Variational autoEncoder) for integrating multi-omics data and identifying cross modal associations

683 96 33
souradipp76
mm-poe

Multiple Choice Reasoning via. Process of Elimination using Multi-Modal Models

545 1 1
johndef64
mychatgpt

mychatgpt is a small and useful Python module that provides functions for interacting with OpenAI's GPT models to create conversational agents. This module allows users to have interactive conversations with the GPT models and keeps track of the conversation history in your Python Projects and Jupyter Notebooks.

448 5 0
kyegomez
tiny-gptv

Simple Implementation of TinyGPTV in super simple Zeta lego blocks

367 16 0
kyegomez
hsss

Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling"

366 15 2
kyegomez
kosmosx

Transformers at zeta scales

363 70 11
    • Data from PyPI, GitHub, ClickHouse, and BigQuery