61 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Python SDK to configure and run evaluations for your LLM-based application | 19K | |
| The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecti... | 9K | |
| Python library for Evaluation | 6K | |
| Multi-agent codebase evaluation and reliability optimization. | 6K | |
| A multi-backend evaluation framework for LLM, RAG, and agentic systems. | 4K | |
| RAG evaluation system using Ragas with Phoenix/Langfuse tracing | 4K | |
| 차별화된 자체 교육 콘텐츠와 실무 중심 교육 | 4K | |
| Enterprise RAG pipelines with native IRIS vector search. 6 production implementa... | 3K | |
| Additional packages (components, document stores and the likes) to extend the ca... | 3K | |
| Pype AI's Agensight is an open-source experimentation studio built for conversat... | 3K | |
| A Python project for FloTorch | 2K | |
| The testing platform for AI teams. Bring engineers, PMs, and domain experts toge... | 2K | |
| TrustyAI's RAGAS provider for Llama Stack | 2K | |
| RagBuilder SDK - Create optimal Production-ready RAG pipelines | 1K | |
| Aerospace related chatbots | 1K | |
| Murnitur empowers AI teams to seamlessly test, evaluate, deploy, monitor, and sa... | 1K | |
| Next-generation knowledge base engine with advanced RAG capabilities | 978 | |
| Acceso a GPT-3 y procesamiento de documentos desde la línea de comandos. | 860 | |
| 834 | ||
| A comprehensive evaluation framework for AI systems | 803 | |
| TigerGraphX is a high-level Python library offering a unified, Python-native int... | 791 | |
| This repo is contains python code that uses LangChain and more to build an AWS b... | 759 | |
| LangEvals Ragas evaluator | 750 | |
| RAG Evaluation Framework using Ragas metrics and MLflow tracking | 698 | |
| Supercharge Your LLM Application Evaluations 🚀 | 608 | |
| Helper modules for AI Engineering RAG Bootcamp reference implementations | 577 | |
| API Testing Framework to automate and simplify API testing using LLM Agents and ... | 570 | |
| 🎧 DJ's Production RAG Pipeline - PDFs → Pinecone → LLM → RAGAS (Sub-10s E2E) | 509 | |
| RAG Benchmarking — Framework-agnostic RAG/agentic-AI evaluation harness. Faithfu... | 503 | |
| Med-Discover is an AI-powered tool designed to assist biomedical researchers by ... | 464 | |
| A framework for evaluating, monitoring, and benchmarking multi-agent systems | 459 | |
| A simple and efficient RAG (Retrieval-Augmented Generation) library with Knowled... | 448 | |
| An application that performs qualitative thematic analysis using LLMs | 387 | |
| Template for AI chatbots & document management using Retrieval-Augmented Generat... | 379 | |
| Project GenEval: A Unified Evaluation Framework for Generative AI Applications | 376 | |
| RAGAS-based evaluation pipeline with trust-specific metrics for TrustRAG | 356 | |
| 356 | ||
| 🌱 Pilot of on-premise RAG system | 303 | |
| Lightweight RAG evaluation framework for Korean language | 267 | |
| RAGAS integration adapter for Metrics Computation Engine | 192 | |
| my nlpswift | 185 | |
| Uploads results from ragas to Tonic Validate. | 177 | |
| parrot evaluation | 153 | |
| A fast, modular reimplementation of RAGAS's FactualCorrectness metric, supportin... | 146 | |
| A microservice using FastAPI, PostgreSQL, OpenSearch, and LangChain. | 132 | |
| synthetic data tooling for LLM training and evaluation | 132 | |
| The Production-Ready Open Source RAG for Education | 123 | |
| 这是一个从 RagFlow 项目的 DeepDoc 模块中抽取出来的专门用于 PDF 解析的 Python 库。它提供了强大的 PDF 文档解析能力,支持 OC... | 114 | |
| 114 | ||
| Pipeline-agnostic evaluation and observability for knowledge graph, RAG, and KOS... | 105 |