CVFeb 27, 2024

Diffusion Model-Based Image Editing: A Survey

arXiv:2402.17525v4262 citationsh-index: 11Has CodeIEEE Trans Pattern Anal Mach Intell
Originality Synthesis-oriented
AI Analysis

It synthesizes existing research for researchers and practitioners in computer vision, but is incremental as it primarily reviews and categorizes prior work.

This survey provides an exhaustive overview of diffusion model-based image editing methods, analyzing them from multiple perspectives and proposing a systematic benchmark called EditEval with an innovative LMM Score metric for evaluation.

Denoising diffusion models have emerged as a powerful tool for various image generation and editing tasks, facilitating the synthesis of visual content in an unconditional or input-conditional manner. The core idea behind them is learning to reverse the process of gradually adding noise to images, allowing them to generate high-quality samples from a complex distribution. In this survey, we provide an exhaustive overview of existing methods using diffusion models for image editing, covering both theoretical and practical aspects in the field. We delve into a thorough analysis and categorization of these works from multiple perspectives, including learning strategies, user-input conditions, and the array of specific editing tasks that can be accomplished. In addition, we pay special attention to image inpainting and outpainting, and explore both earlier traditional context-driven and current multimodal conditional methods, offering a comprehensive analysis of their methodologies. To further evaluate the performance of text-guided image editing algorithms, we propose a systematic benchmark, EditEval, featuring an innovative metric, LMM Score. Finally, we address current limitations and envision some potential directions for future research. The accompanying repository is released at https://github.com/SiatMMLab/Awesome-Diffusion-Model-Based-Image-Editing-Methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes