Activation steering and trait monitoring for HuggingFace transformers
Linear probes and activation steering for transformer models