Transformers

Attention Is All You Need
Introduces the Transformer, a novel neural network architecture based solely on attention mechanisms for sequence transduction, improving machine translation quality, training speed, and parallelization over recurrent and convolutional models.
proceedings.neurips.ccAttention is all you need paper
Transformers Explained Visually (Part 1): Overview of Functionality | Towards Data Science
towardsdatascience.com
Transformers Explained Visually I
Patchifying Images for Computer Vision
Vision TransFormers original paper

SAM model
Ideas related to this collection