- Billions of images are scraped from the internet. These images, along with their text descriptions, are saved in a database. - The AI model uses this database to train through reverse diffusion. - Diffusion adds noise to an image (from the dog to random pixels). - Reverse diffusion turns noise back into an image.