adept/fuyu-8b · Hugging Face

Fuyu-8B Model Card

Note: Running Fuyu requires https://github.com/huggingface/transformers/pull/26911, which may require running transformers on main!

Model

Fuyu-8B is a multi-modal text and image transformer trained by Adept AI.

Architecturally, Fuyu is a vanilla decoder-only transformer - there is no image encoder.

Image patches are instead linearly... See more