12 dependents
Package Description Downloads/month
vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers M... 37K
A high-performance API server that provides OpenAI-compatible endpoints for MLX ... 22K
Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, Vision, TTS, STT,... 14K
vLLM-like inference for Apple Silicon - GPU-accelerated Text, Image, Video & Aud... 11K
MLX Omni Server is a local inference server powered by Apple's MLX framework, sp... 3K
An optimized MLX (Apple Silicon Metal) Server for running local LLMs with higher... 997
Engram-inspired memory MCP server with hot cache and pattern mining 771
Train Embedding Models on MLX. 765
High-performance, local RAG search engine and MCP/API server for Apple Silicon 519
MLX-GUI MLX Inference Server for Apple Silicone 190
Add your description here 87
A Model Context Protocol (MCP) server for indexing and semantically searching PD... 70