PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
modelscope
modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

4M 9K 934
agentscope-ai
agentscope

Build and run agents you can see, understand and trust.

202K 25K 3K
docarray
docarray

Represent, send, store and search multimodal data

144K 3K 241
awslabs
pyrhubarb

A Python framework for multi-modal document understanding with Amazon Bedrock

44K 103 14
MedMNIST
medmnist

[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification

39K 1K 207
valhalla
pyvalhalla

Open Source Routing Engine for OpenStreetMap

6K 6K 888
lucidrains
dalle-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

6K 6K 643
answerdotai
byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

5K 847 93
OFA-Sys
cn-clip

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

4K 6K 552
valhalla
pyvalhalla-weekly

Open Source Routing Engine for OpenStreetMap

4K 6K 888
kyegomez
qwen

My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't released model code yet sooo...

4K 12 2
datajuicer
py-data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

4K 6K 368
lucidrains
transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

4K 1K 71
zjunlp
deepke

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

3K 4K 742
BrainLesion
brainles-preprocessing

preprocessing tools for multi-modal 3D brain imaging

2K 31 8
avitai
avitai-artifex

A research-focused modular generative modeling library built on JAX/Flax NNX

1K 1 0
kyegomez
vision-llama

Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta

889 16 0
kyegomez
switch-transformers

Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"

755 139 17
RasmussenLab
move-dl

Multi-omics variational autoencoder

655 94 33
souradipp76
mm-poe

Multiple Choice Reasoning via. Process of Elimination using Multi-Modal Models

512 1 1
kyegomez
hsss

Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling"

385 15 2
kyegomez
kosmosx

Transformers at zeta scales

369 70 11
kyegomez
rt2

Democratization of RT-2 "RT-2: New model translates vision and language into action"

333 566 68
kyegomez
mm1-torch

MM1 - Pytorch

325 27 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery