Fast Multi-dimensional Sparse Attention
Neighborhood Attention for Apple Silicon - MLX backend
Neighborhood Attention for Apple Silicon — PyTorch MPS backend