Attention
Patchifying Images for Computer Vision
Attention Is All You Need
Introduces the Transformer, a novel neural network architecture based solely on attention mechanisms for sequence transduction, improving machine translation quality, training speed, and parallelization over recurrent and convolutional models.
proceedings.neurips.ccAttention is all you need paper

Transformers Explained Visually (Part 1): Overview of Functionality | Towards Data Science
towardsdatascience.com
Transformers Explained Visually I
SAM model
Vision TransFormers original paper

Ideas related to this collection