Dependents of partial-json-parser

49 dependents

Package	Description	Downloads/month
sglang	SGLang is a high-performance serving framework for large language models and mul...	287.7M
vllm	A high-throughput and memory-efficient inference and serving engine for LLMs	9.4M
vllm-tpu	A high-throughput and memory-efficient inference and serving engine for LLMs	143K
lmdeploy	A toolset for compressing, deploying and serving LLM	123K
marvin	a simple and powerful tool to get things done with AI	79K
vllm-cpu	Wheels & Docker images for running vLLM on CPU-only systems, optimized for diffe...	31K
tensorrt-llm	TensorRT LLM provides users with an easy-to-use Python API to define Large Langu...	16K
mindroot	MindRoot AI Agent Framework	14K
easydel	Accelerate, Optimize performance with streamlined training and serving options w...	9K
aphrodite-engine	Large-scale LLM inference engine	7K
coreason-runtime	The official zero-trust, high-throughput kinetic execution engine for the coreas...	4K
sglang-kt	SGLang is a high-performance serving framework for large language models and mul...	4K
briton	Python component of using Briton	4K
vllm-cpu-avx512bf16	Wheels & Docker images for running vLLM on CPU-only systems, optimized for diffe...	3K
edgevox	Offline voice agent framework for robots.	3K
genkit	Open-source framework for building AI-powered apps in JavaScript, Go, and Python...	3K
ftllm	fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型，任意10G以上显卡即可推理满血DeepSeek。双路900...	3K
vllm-cpu-avx512vnni	vLLM CPU inference engine (AVX512 + VNNI optimized)	3K
vllm-cpu-amxbf16	Wheels & Docker images for running vLLM on CPU-only systems, optimized for diffe...	3K
vllm-cpu-avx512	vLLM CPU inference engine (AVX512 optimized)	2K
furiosa-llm	FuriosaAI SDK	2K
infinity-parser2	INF Tech's open-source MLLMs for SOTA visual-language understanding and advanced...	1K
tclaude	A complete terminal implementation of Anthropic's Claude.	1K
promptools	useful utilities for prompt engineering	1K
lazyllm-lmdeploy	A toolset for compressing, deploying and serving LLM	887
siada-cli	Siada CLI is a Ai Pair Programming Tool in terminal	689
nm-vllm	General Information, model certifications, and benchmarks for nm-vllm enterprise...	666
vllm-kunlun	vLLM Kunlun3 backend plugin	464
vllm-hust	A high-throughput and memory-efficient inference and serving engine for LLMs	437
wxy-test	A high-throughput and memory-efficient inference and serving engine for LLMs	375
synth-machine		353
ai-dynamo-vllm	A high-throughput and memory-efficient inference and serving engine for LLMs	344
sglang-jax	JAX backend for SGL	295
mmirage	Modular Multimodal Intelligent Reformatting and Augmentation Generation Engine -...	260
llm-conversation	A tool for LLM agent conversations	219
sglang-usf	SGLang is yet another fast serving framework for large language models and visio...	209
skillengine	SkillEngine — framework-agnostic skills engine for LLM agents. Claude Code-like ...	192
ggmw	A minimal wrapper for the google gemini (google-genai) API	189
power-sglang-cuda124	SGLang fork for ppc64le with CUDA 12.4 and Torch Triton support	186
vllm-rocm	A high-throughput and memory-efficient inference and serving engine for LLMs	176
scans2any	Convert infrastructure scans into various output formats such as Markdown tables...	151
vllm-emissary	A high-throughput and memory-efficient inference and serving engine for LLMs	132
vllm-usf	A high-throughput and memory-efficient inference and serving engine for LLMs	115
vllm-test-tpu	A high-throughput and memory-efficient inference and serving engine for LLMs	80
zensols-lmtask	Inferencing and Training Large Language Model Tasks	73
genkit-ai	Genkit AI Framework	70
xingen	An agent framework using LLMs	56
vllm-fixed	A high-throughput and memory-efficient inference and serving engine for LLMs	42
sglang-cpu	SGLang is a fast serving framework for large language models and vision language...	2