61 dependents
Package Description Downloads/month
Python SDK to configure and run evaluations for your LLM-based application 19K
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecti... 9K
Python library for Evaluation 6K
Multi-agent codebase evaluation and reliability optimization. 6K
A multi-backend evaluation framework for LLM, RAG, and agentic systems. 4K
RAG evaluation system using Ragas with Phoenix/Langfuse tracing 4K
차별화된 자체 교육 콘텐츠와 실무 중심 교육 4K
Enterprise RAG pipelines with native IRIS vector search. 6 production implementa... 3K
Additional packages (components, document stores and the likes) to extend the ca... 3K
Pype AI's Agensight is an open-source experimentation studio built for conversat... 3K
A Python project for FloTorch 2K
The testing platform for AI teams. Bring engineers, PMs, and domain experts toge... 2K
TrustyAI's RAGAS provider for Llama Stack 2K
RagBuilder SDK - Create optimal Production-ready RAG pipelines 1K
Aerospace related chatbots 1K
Murnitur empowers AI teams to seamlessly test, evaluate, deploy, monitor, and sa... 1K
Next-generation knowledge base engine with advanced RAG capabilities 978
Acceso a GPT-3 y procesamiento de documentos desde la línea de comandos. 860
834
A comprehensive evaluation framework for AI systems 803
TigerGraphX is a high-level Python library offering a unified, Python-native int... 791
This repo is contains python code that uses LangChain and more to build an AWS b... 759
LangEvals Ragas evaluator 750
RAG Evaluation Framework using Ragas metrics and MLflow tracking 698
Supercharge Your LLM Application Evaluations 🚀 608
Helper modules for AI Engineering RAG Bootcamp reference implementations 577
API Testing Framework to automate and simplify API testing using LLM Agents and ... 570
🎧 DJ's Production RAG Pipeline - PDFs → Pinecone → LLM → RAGAS (Sub-10s E2E) 509
RAG Benchmarking — Framework-agnostic RAG/agentic-AI evaluation harness. Faithfu... 503
Med-Discover is an AI-powered tool designed to assist biomedical researchers by ... 464
A framework for evaluating, monitoring, and benchmarking multi-agent systems 459
A simple and efficient RAG (Retrieval-Augmented Generation) library with Knowled... 448
An application that performs qualitative thematic analysis using LLMs 387
Template for AI chatbots & document management using Retrieval-Augmented Generat... 379
Project GenEval: A Unified Evaluation Framework for Generative AI Applications 376
RAGAS-based evaluation pipeline with trust-specific metrics for TrustRAG 356
356
🌱 Pilot of on-premise RAG system 303
Lightweight RAG evaluation framework for Korean language 267
RAGAS integration adapter for Metrics Computation Engine 192
my nlpswift 185
Uploads results from ragas to Tonic Validate. 177
parrot evaluation 153
A fast, modular reimplementation of RAGAS's FactualCorrectness metric, supportin... 146
A microservice using FastAPI, PostgreSQL, OpenSearch, and LangChain. 132
synthetic data tooling for LLM training and evaluation 132
The Production-Ready Open Source RAG for Education 123
这是一个从 RagFlow 项目的 DeepDoc 模块中抽取出来的专门用于 PDF 解析的 Python 库。它提供了强大的 PDF 文档解析能力,支持 OC... 114
114
Pipeline-agnostic evaluation and observability for knowledge graph, RAG, and KOS... 105