In-memory vector store with FastEmbed integration for Python applications.
scraping and querying documents for LLMs
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
Rust-based retrieval system with hybrid search (vector + BM25), async ingestion, and gRPC-first API