Fuyu-8B Model Card
Note: Running Fuyu requires https://github.com/huggingface/transformers/pull/26911, which may require running transformers on main!
Model
Fuyu-8B is a multi-modal text and image transformer trained by Adept AI.
Architecturally, Fuyu is a vanilla decoder-only transformer - there is no image encoder.
Image patches are instead linearly ... See more