Gradually-Warmup Learning Rate Scheduler for PyTorch
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
PyTorch extension for alternative backward rules and gradient transforms (STE, gradient jamming, non-standard activations).