Tutorial on Diffusion Models for Imaging and Vision
It serves as an educational resource for undergraduate and graduate students interested in researching or applying diffusion models, but it is incremental as it does not present new research.
This tutorial discusses the essential ideas behind diffusion models, which are generative tools that have enabled applications like text-to-image and text-to-video generation by overcoming shortcomings in previous approaches.
The astonishing growth of generative tools in recent years has empowered many exciting applications in text-to-image generation and text-to-video generation. The underlying principle behind these generative tools is the concept of diffusion, a particular sampling mechanism that has overcome some shortcomings that were deemed difficult in the previous approaches. The goal of this tutorial is to discuss the essential ideas underlying the diffusion models. The target audience of this tutorial includes undergraduate and graduate students who are interested in doing research on diffusion models or applying these models to solve other problems.