A Keras-based and TensorFlow-backend language model toolkit.
An implementation of the paper: "Gated Slot Attention for Efficient Linear-Time Sequence Modeling" in PyTorch