Dependents of prometheus-fastapi-instrumentator

91 dependents

Package	Description	Downloads/month
vllm	A high-throughput and memory-efficient inference and serving engine for LLMs	9.4M
inference-gpu	Turn any computer or edge device into a command center for your computer vision ...	1.1M
leptonai	Lepton AI Platform	447K
vllm-tpu	A high-throughput and memory-efficient inference and serving engine for LLMs	143K
inference	Turn any computer or edge device into a command center for your computer vision ...	119K
vllm-cpu	Wheels & Docker images for running vLLM on CPU-only systems, optimized for diffe...	31K
aegra-api	Open source alternative to LangGraph Platform (now LangSmith Deployments) - Self...	29K
tensorrt-llm	TensorRT LLM provides users with an easy-to-use Python API to define Large Langu...	16K
caretaker-github	Agentic Repo Maintenance	13K
fastapi-rtk	A package that provides a set of tools to build a FastAPI application with a Cla...	13K
fastramqpi	Rammearkitektur integrations framework	12K
openrag	OpenRAG is a comprehensive Retrieval-Augmented Generation platform that enables ...	11K
llama-agents-appserver	Application server components for LlamaDeploy	11K
cpex	A lightweight plugin framework for building extensible AI systems	9K
inference-core	Turn any computer or edge device into a command center for your computer vision ...	9K
aphrodite-engine	Large-scale LLM inference engine	7K
inference-cpu	Turn any computer or edge device into a command center for your computer vision ...	7K
delphai-fastapi	Package for fastAPI models	6K
mcp-contextforge-gateway	An AI Gateway, registry, and proxy that sits in front of any MCP, A2A, or REST/g...	5K
rockai	Python SDK for RockAI.online	5K
vvr-scraper	Công cụ tải truyện CỰC ĐỘC ĐÁO từ Valvrareteam.net	4K
vllm-cpu-avx512bf16	Wheels & Docker images for running vLLM on CPU-only systems, optimized for diffe...	3K
llama-deploy-appserver	Application server components for LlamaDeploy	3K
agentic-fleet	Adaptive Agentic AI Reasoning using Microsoft Agent Framework -- Join the Discor...	3K
vllm-cpu-avx512vnni	vLLM CPU inference engine (AVX512 + VNNI optimized)	3K
vllm-cpu-amxbf16	Wheels & Docker images for running vLLM on CPU-only systems, optimized for diffe...	3K
matter-observability	Matter's Observability Library - Includes all observability functions, including...	3K
vllm-cpu-avx512	vLLM CPU inference engine (AVX512 optimized)	2K
msabase	General Package for Microservices based on FastAPI like Profiler, Scheduler, Sys...	2K
qena-shared-lib	A shared tools for other services	2K
c2casgiutils	Common utilities for Camptocamp ASGI applications	2K
mentat-gulp	gULP - (generic) Unified Log Processor.	2K
estimenergy	Estimate Energy Consumption	1K
evo-featureflags-server	Feature flags server	931
ai-application-gateway	API看板服务	873
axiom-rag-engine	Citation-verified RAG service with deterministic + semantic claim verification.	769
keynetra	Open-source High-performance authorization engine for RBAC, ReBAC, and ACL. Mult...	761
opea-comps	Generative AI components	722
nm-vllm	General Information, model certifications, and benchmarks for nm-vllm enterprise...	666
gx-mcp-server	Expose Great Expectations data-quality checks via MCP	646
ioc-knowledge-memory	Package for IoC knowledge management	645
grid-intelligence	GRID - Geometric Resonance Intelligence Driver: A comprehensive framework for ex...	635
z3rno-server	FastAPI server for Z3rno: REST API, authentication, rate limiting, and Celery wo...	625
p1-hybrid-jailbreak-detector	Defense-in-depth input safety for LLMs — perplexity gate + FAISS + ModernBERT +...	515
vllm-kunlun	vLLM Kunlun3 backend plugin	464
textembed	TextEmbed is a REST API crafted for high-throughput and low-latency embedding in...	440
vllm-hust	A high-throughput and memory-efficient inference and serving engine for LLMs	437
wxy-test	A high-throughput and memory-efficient inference and serving engine for LLMs	375
vulcan-ms-core	Python package with core python to use in microservices	357
svoi-framework	Add your description here	355