PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
embeddings-benchmark
mteb

MTEB: Massive Text Embedding Benchmark

2.7M 3K 608
MinishLab
model2vec

Fast State-of-the-Art Static Embeddings

491K 2K 121
huggingface
setfit

Efficient few-shot learning with Sentence Transformers

214K 3K 258
beir-cellar
beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

39K 2K 243
ddangelov
top2vec

Top2Vec learns jointly embedded topic, document and word vectors.

8K 3K 377
loreMemory
loremem-ai

Persistent AI memory for LLMs and AI agents. Local-first. Learns from every interaction.

3K 0 0
pandora-intelligence
fast-sentence-transformers

This repository contains code to run faster sentence-transformers. Simply, faster, sentence-transformers.

2K 144 10
ronaldgosso
semantic-keywords

TF-IDF counts words. semantic-keywords understands meaning. It uses sentence embeddings (all-MiniLM-L6-v2 by default) and Maximal Marginal Relevance (MMR) to return keywords that are both relevant and diverse — not just the most frequent phrases. Works fully offline after a one-time model download. No API key. No rate limits.

1K 0 0
davidberenstein1957
classy-classification

This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-shot classification with Huggingface.

1K 220 15
factlens
factlens

Geometric LLM hallucination detection. No second LLM. Deterministic. Auditable.

937 0 0
bhavsarpratik
easy-transformers

Utility functions to work with transformers

824 10 3
jimnoneill
obsidian-umbra

Turn any Obsidian vault into a Zettelkasten graph — locally, with a dozen years of notes in minutes. 4-phase pipeline: daily splitter (Qwen3-4B) → semantic backlinks (Potion-32M) → keyword linker → synonym clustering (GTE-large + HDBSCAN). Zero cloud.

660 3 0
fireindark707
schema-matching

A python tool using XGboost and sentence-transformers to perform schema matching task on tables.

634 40 13
MartinoMensio
spacy-sentence-bert

Sentence transformers models for SpaCy

595 108 7
KRR-Oxford
hierarchy-transformers

Language Models as Hierarchy Encoders

482 42 8
rasyosef
splade-index

Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba

460 38 1
cornelcroi
context-lens

Semantic search knowledge base for MCP-enabled AI assistants. Index local files or GitHub repos, query with natural language. Built on LanceDB vector storage. Works with Claude Desktop, Cursor, and other MCP clients.

447 21 2
galenphall
incite-app

Local-first citation recommendation system

445 0 1
emapco
rk-transformers

Export and Run Hugging Face Transformers Models on Rockchip NPUs

411 25 2
limcheekin
open-text-embeddings

Open Source Text Embedding Models with OpenAI Compatible API

399 168 25
LazerLambda
modern-bert-score

Re-implementation of BERTScore for evaluation of generated text, leveraging vLLM and SentenceTransformers.

363 0 0
psarno
pyragix

Local-first Python RAG pipeline with sentence-transformer embeddings, FAISS/BM25 hybrid retrieval, query expansion, reranking, and Ollama-driven generation.

307 1 0
AstraBert
sentrev

Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs

292 30 1
turian
embeddingcache

Retrieve text embeddings, but cache them locally if we have already computed them.

291 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery