Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
This addresses the problem of achieving precise and versatile image generation for applications in AI and creative industries, representing a novel method rather than an incremental improvement.
The paper tackles high-quality image generation by introducing Edify Image, a family of diffusion models using a novel Laplacian diffusion process to generate photorealistic images with pixel-perfect accuracy, supporting applications like text-to-image synthesis and 4K upsampling.
We introduce Edify Image, a family of diffusion models capable of generating photorealistic image content with pixel-perfect accuracy. Edify Image utilizes cascaded pixel-space diffusion models trained using a novel Laplacian diffusion process, in which image signals at different frequency bands are attenuated at varying rates. Edify Image supports a wide range of applications, including text-to-image synthesis, 4K upsampling, ControlNets, 360 HDR panorama generation, and finetuning for image customization.