https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.