Attention
Vision TransFormers original paper
Attention Is All You Need
Introduces the Transformer, a novel neural network architecture based solely on attention mechanisms for sequence transduction, improving machine translation quality, training speed, and parallelization over recurrent and convolutional models.
proceedings.neurips.ccAttention is all you need paper
Patchifying Images for Computer Vision

Transformers Explained Visually (Part 1): Overview of Functionality | Towards Data Science
towardsdatascience.com
Transformers Explained Visually I
SAM model

Ideas related to this collection