Back to Annotated Deep Learning Paper Implementations

Fast weights transformer

docs/transformers/fast_weights/readme.html

latest791 B
Original Source

hometransformersfast_weights

View code on Github

#

Fast weights transformer

This is an annotated implementation of the paper Linear Transformers Are Secretly Fast Weight Memory Systems in PyTorch.

Here is the annotated implementation. Here are the training code and a notebook for training a fast weights transformer on the Tiny Shakespeare dataset.

labml.ai