GitHub - YingqingHe/ScaleCrafter: Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.

Google presents Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
Propose a novel recipe for training non-cascaded large scale pixel-space text-to-image diffusion models
https://t.co/L4ElMddJoe https://t.co/vsMtG4uIhw

Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
"We propose Reflect-DiT, a method that enables
Diffusion Transformers to refine their generations using in-context examples of previously generated images alongside
textual feedback describing necessary... See more

We also use this cascading technique to generate 256x256 class-conditional natural images. In a follow-up work, we pushed the limit of cascaded generation on 256x256 ImageNet images, outperforming BigGAN in FID scores: https://t.co/GURWMomK36 https://t.co/5kIS932Q6M

