Quality tutorials on transformers.
The Transformer … “Explained”?
An intuitive explanation of the Transformer by motivating it through the lens of CNNs, RNNs, etc.
Illustrated Guide to Transformers: Step by Step Explanation
In this post, we’ll focus on the one paper that started it all, “Attention is all you need”.
The Illustrated Transformer
In this post, we will look at The Transformer – a model that uses attention to boost the speed with which these models can be trained.
The Annotated GPT-2
GPT-2 explained with visualization and PyTorch code.
Illustrated: Self-Attention
Step-by-step guide to self-attention with illustrations and code.
The Annotated Transformer
In this post I present an “annotated” version of the paper in the form of a line-by-line implementation.
Visual Paper Summary: ALBERT(A Lite BERT)
An illustrated summary of ALBERT paper and how it improves BERT and makes it resource efficient
A Visual Guide to Using BERT for the First Time
Tutorial for how to use a variant of BERT to classify sentences.
