Sentence Transformers Python Packages

mteb

MTEB: Massive Text Embedding Benchmark

2.7M 3K 608

model2vec

Fast State-of-the-Art Static Embeddings

491K 2K 121

setfit

Efficient few-shot learning with Sentence Transformers

214K 3K 258

beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

39K 2K 243

top2vec

Top2Vec learns jointly embedded topic, document and word vectors.

8K 3K 377

loremem-ai

Persistent AI memory for LLMs and AI agents. Local-first. Learns from every interaction.

3K 0 0

fast-sentence-transformers

This repository contains code to run faster sentence-transformers. Simply, faster, sentence-transformers.

2K 144 10

semantic-keywords

TF-IDF counts words. semantic-keywords understands meaning. It uses sentence embeddings (all-MiniLM-L6-v2 by default) and Maximal Marginal Relevance (MMR) to return keywords that are both relevant and diverse — not just the most frequent phrases. Works fully offline after a one-time model download. No API key. No rate limits.

1K 0 0

classy-classification

This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-shot classification with Huggingface.

1K 220 15

factlens

Geometric LLM hallucination detection. No second LLM. Deterministic. Auditable.

937 0 0

easy-transformers

Utility functions to work with transformers

824 10 3

obsidian-umbra

Turn any Obsidian vault into a Zettelkasten graph — locally, with a dozen years of notes in minutes. 4-phase pipeline: daily splitter (Qwen3-4B) → semantic backlinks (Potion-32M) → keyword linker → synonym clustering (GTE-large + HDBSCAN). Zero cloud.

660 3 0

schema-matching

A python tool using XGboost and sentence-transformers to perform schema matching task on tables.

634 40 13

spacy-sentence-bert

Sentence transformers models for SpaCy

595 108 7

hierarchy-transformers

Language Models as Hierarchy Encoders

482 42 8

splade-index

Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba

460 38 1

context-lens

Semantic search knowledge base for MCP-enabled AI assistants. Index local files or GitHub repos, query with natural language. Built on LanceDB vector storage. Works with Claude Desktop, Cursor, and other MCP clients.

447 21 2

incite-app

Local-first citation recommendation system

445 0 1

rk-transformers

Export and Run Hugging Face Transformers Models on Rockchip NPUs

411 25 2

open-text-embeddings

Open Source Text Embedding Models with OpenAI Compatible API

399 168 25

modern-bert-score

Re-implementation of BERTScore for evaluation of generated text, leveraging vLLM and SentenceTransformers.

363 0 0

pyragix

Local-first Python RAG pipeline with sentence-transformer embeddings, FAISS/BM25 hybrid retrieval, query expansion, reranking, and Ollama-driven generation.

307 1 0

sentrev

Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs

292 30 1

embeddingcache

Retrieve text embeddings, but cache them locally if we have already computed them.

291 1 0

Search Packages