Dependents of mlx-lm

135 dependents

Package	Description	Downloads/month
mlx-vlm	MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VL...	349K
mlx-audio	A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library ...	78K
cc-sentiment	An open experiment: does developer sentiment with Claude Code vary by time of da...	72K
vmlx	vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers M...	36K
mlx-openai-server	A high-performance API server that provides OpenAI-compatible endpoints for MLX ...	22K
rapid-mlx	The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s ca...	18K
mlx-tune	Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, Vision, TTS, STT,...	14K
optillm	Optimizing inference proxy for LLMs	12K
vllm-mlx	vLLM-like inference for Apple Silicon - GPU-accelerated Text, Image, Video & Aud...	10K
leann-core	Core API and plugin system for LEANN	7K
mlx-optiq	Mixed-precision quantization optimizer for MLX models on Apple Silicon	6K
synapt	Persistent conversational memory for AI coding assistants	5K
alienskyqwen	AlienSky optimized inference engine for Qwen on Apple Silicon	4K
mlx-ash-kv	Asynchronous Self-Healing KV Cache for Silicon-Native LLMs by GDI Nexus	3K
mlx-omni-server	MLX Omni Server is a local inference server powered by Apple's MLX framework, sp...	3K
bmplib-ai	This package implements all the logic of Brief My Press.AI	3K
dflash-mlx	Lossless DFlash speculative decoding for MLX on Apple Silicon	3K
turboquant-mlx-full	Extreme weight and KV cache compression for LLMs on Apple Silicon (MLX implement...	3K
mlx-lm-lora	Train LLMs on Apple silicon with MLX and the Hugging Face Hub	3K
mlx-code	Coding Agent for Mac	3K
z-vision-generator	Z-Vision Generator — cross-platform AI image and video generator	2K
qwen3-tts	Qwen3 CustomVoice 命令行工具	2K
augllm	This is LLM interface library.	2K
kamiwaza-mlx	Unified MLX server & CLI (language and vision) with OpenAI-compatible endpoints	2K
m5-infer	Extraordinary speed, extraordinary quality — an MLX-based inference engine for A...	2K
mlx-knife	HuggingFace model management for MLX on Apple Silicon	2K
mlx-flash	Run AI models too large for your Mac's memory — expert caching, speculative exec...	1K
plllm-mlx	Standalone MLX-based LLM inference service with OpenAI compatible API	1K
mlx-audio-plus	Python tools for text to speech (TTS), speech to text (STT), and speech to speec...	1K
openhost	Run local LLMs from Python. LangChain-compatible. llama.cpp + MLX backends.	1K
kandiga	3 AI models. 161B parameters. One Mac. 5.5GB. Full agentic pipeline on Apple Sil...	1K
mlx-manager	An optimized MLX (Apple Silicon Metal) Server for running local LLMs with higher...	1K
meetcap	Offline meeting recorder & summarizer for macOS	1K
llm-mlx	Support for MLX models in LLM	954
dora-qwen	dora-qwen	902
cortex-llm	GPU-Accelerated LLM Terminal for Apple Silicon	886
mlx-lens	Mechanistic interpretability on Apple Silicon: steering vectors, residual captur...	862
ernie-image-mlx	Pure MLX port of Baidu ERNIE-Image (8B text-to-image DiT) for Apple Silicon infe...	841
toolio	GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered st...	830
asimov-mlx-module	ASIMOV MLX Module	787
voice-command	VAD-driven streaming voice dictation for macOS — local Whisper ASR + Silero VAD ...	683
mlx-embeddings-lora	Train Embedding Models on MLX.	678
vllm-metal	vLLM hardware plugin for Apple Silicon - unifies MLX and PyTorch under a single ...	638
chat-with-mlx	A Retrieval-augmented Generation (RAG) chat interface with support for multiple ...	624
hanzo-net	Hanzo Network - Distributed AI compute network for running models locally and re...	621
agent-framework-mlx	MLX integration for the Agent Framework	565
littlehive	LittleHive local-first multi-agent assistant foundation	552
mlx-llm-server	For inferring and serving local LLMs using the MLX framework	543
forgellm	A comprehensive toolkit for end-to-end continued pre-training, fine-tuning, moni...	530
llama-index-llms-mlx	llama-index llms mlx integration	515