Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.
Programmable Neural Network Compression