A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/Pallas/JAX).
Cross-platform FlashAttention-2 Triton implementation for Turing+ GPUs with custom configuration mode