Sparse Autoencoder Python Packages

openinterp

openinterp — Python SDK + CLI. FabricationGuard hallucination probe + ProbeBench leaderboard + Atlas search + Trace generation. pip install openinterp

1K 0 0

mlx-lens

Mechanistic interpretability on Apple Silicon: steering vectors, residual capture, and SAE analysis for MLX models

862 1 0

mechreward

Mechanistic interpretability as reward signal for RL training of LLMs

678 5 0

tiny-dashboard

A tool for visualizing and exploring feature activations in neural language models.

662 16 2

sansa

SANSA - sparse EASE for millions of items

579 46 6

Search Packages