A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model
Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
Tools for creating data series that can be charted