Kimi Python Packages

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

9.4M 79K 16K

vllm-tpu

A high-throughput and memory-efficient inference and serving engine for LLMs

143K 79K 16K

coding-proxy

A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...

15K 15 1

kitty-bridge

Universal LLM bridge for AI agents. Use Claude Code with MiniMax, Codex with GLM, or Gemini CLI with OpenRouter — one command, any provider. Works with coding agents, OpenClaw, Hermes, and others.

7K 8 2

devorch

A terminal-native, multi-provider intelligent assistant that plans, executes, and tracks developer tasks, not just answers prompts, similar to Claude Code and Gemini CLI.

559 4 0

vllm-hust

A high-throughput and memory-efficient inference and serving engine for LLMs

437 79K 16K

wxy-test

A high-throughput and memory-efficient inference and serving engine for LLMs

375 2K 1K

llm-onesdk

OneSDK is a Python library that provides a unified interface for interacting with various Large Language Model (LLM) providers.

363 2 0

vllm-xft

A high-throughput and memory-efficient inference and serving engine for LLMs

345 79K 16K

ai-dynamo-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

344 79K 16K

vllm-acc

A high-throughput and memory-efficient inference and serving engine for LLMs

342 79K 16K

nextai-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

273 79K 16K

vllm-consul

A high-throughput and memory-efficient inference and serving engine for LLMs

219 79K 16K

vllm-npu

A high-throughput and memory-efficient inference and serving engine for LLMs

209 79K 16K

kimi4free

Simple API Wrapper for Kimi

207 4 1

vllm-musa

A high-throughput and memory-efficient inference and serving engine for LLMs

194 79K 16K

vllm-rocm

A high-throughput and memory-efficient inference and serving engine for LLMs

176 79K 16K

chatpilot

ChatPilot: Chat Agent Web UI，实现Chat对话前端，支持Google搜索、文件网址对话（RAG）、代码解释器功能，复现了Kimi Chat(文件，拖进来；网址，发出来)。

134 599 59

vllm-emissary

A high-throughput and memory-efficient inference and serving engine for LLMs

132 79K 16K

vllm-usf

A high-throughput and memory-efficient inference and serving engine for LLMs

115 79K 16K

aether-cli

A command-line interface for interacting with various AI models.

113 0 0

tilearn-infer

A high-throughput and memory-efficient inference and serving engine for LLMs

107 79K 16K

vllm-online

A high-throughput and memory-efficient inference and serving engine for LLMs

82 79K 16K

vllm-test-tpu

A high-throughput and memory-efficient inference and serving engine for LLMs

80 79K 16K

Search Packages