237 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Run LLMs with MLX | 1.5M | |
| MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VL... | 349K | |
| Examples in the MLX framework | 117K | |
| A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library ... | 78K | |
| MLX-Embeddings is the best package for running Vision and Language Embedding mod... | 51K | |
| MLX native implementations of state-of-the-art generative image models | 40K | |
| vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers M... | 36K | |
| A high-performance API server that provides OpenAI-compatible endpoints for MLX ... | 22K | |
| The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s ca... | 18K | |
| Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, Vision, TTS, STT,... | 14K | |
| An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX. | 11K | |
| vLLM-like inference for Apple Silicon - GPU-accelerated Text, Image, Video & Aud... | 10K | |
| Core API and plugin system for LEANN | 7K | |
| Probably Hardly Ever Works — search-based superoptimizer for MLX/Metal | 7K | |
| InstructLab Core package. Use this to chat with a model and execute the Instruc... | 7K | |
| port of mlx-video but attempting to get audio ported as well finally | 6K | |
| Lumerico's Comprehensive Interface for Deep Learning | 6K | |
| Mixed-precision quantization optimizer for MLX models on Apple Silicon | 6K | |
| Timeseries signal processing implementations in ezmsg | 5K | |
| Implementation of F5-TTS in MLX | 5K | |
| Reusable mid-level building blocks for MLX — causal convolutions, multi-dim RoPE... | 5K | |
| Moshi is moshi, but running on macOS | 5K | |
| Qwen3-ASR speech recognition on Apple Silicon via MLX | 5K | |
| Merlin — a fast local LLM for agentic coding on Apple Silicon | 5K | |
| Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based ... | 4K | |
| AlienSky optimized inference engine for Qwen on Apple Silicon | 4K | |
| Asynchronous Self-Healing KV Cache for Silicon-Native LLMs by GDI Nexus | 3K | |
| 3K | ||
| Lossless DFlash speculative decoding for MLX on Apple Silicon | 3K | |
| Extreme weight and KV cache compression for LLMs on Apple Silicon (MLX implement... | 3K | |
| Train LLMs on Apple silicon with MLX and the Hugging Face Hub | 3K | |
| Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon | 3K | |
| Z-Vision Generator — cross-platform AI image and video generator | 2K | |
| On-device Image Generation for Apple Silicon | 2K | |
| Qwen3 CustomVoice 命令行工具 | 2K | |
| RETIX - The Optic Nerve for Autonomous Agents | 2K | |
| Unified MLX server & CLI (language and vision) with OpenAI-compatible endpoints | 2K | |
| Extraordinary speed, extraordinary quality — an MLX-based inference engine for A... | 2K | |
| Local speech synthesis for Apple Silicon — TTS, voice cloning, dialogue, and sou... | 2K | |
| Python utility for text embeddings in MLX. | 2K | |
| HuggingFace model management for MLX on Apple Silicon | 2K | |
| CanViT inference on Apple Silicon via MLX | 2K | |
| LLM fine-tuning & RL framework for MLX. | 2K | |
| YOLO26 implementation using Apple MLX framework | 2K | |
| Run AI models too large for your Mac's memory — expert caching, speculative exec... | 1K | |
| MLX + Metal implementation of mHC: Manifold-Constrained Hyper-Connections by Dee... | 1K | |
| Python tools for text to speech (TTS), speech to text (STT), and speech to speec... | 1K | |
| Metal-accelerated Vision Mamba for Apple Silicon (2D/3D/4D) with 3.9x training s... | 1K | |
| 3 AI models. 161B parameters. One Mac. 5.5GB. Full agentic pipeline on Apple Sil... | 1K | |
| Metal Flash Attention for MLX | 1K |