Transformers: Architecture, Training & Usage

Where Transformer and self-supervised learning first triumphed

Link