CVApr 11, 2024

ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation

arXiv:2404.07564v1h-index: 10MM
Originality Incremental advance
AI Analysis

This work addresses the challenge of generating realistic images from layouts for applications in computer vision and graphics, representing an incremental improvement over existing methods.

The paper tackles the problem of layout-to-image generation by introducing ObjBlur, a curriculum learning approach with progressive object-level blurring, which stabilizes training and improves image quality, achieving state-of-the-art results on COCO and Visual Genome datasets.

We present ObjBlur, a novel curriculum learning approach to improve layout-to-image generation models, where the task is to produce realistic images from layouts composed of boxes and labels. Our method is based on progressive object-level blurring, which effectively stabilizes training and enhances the quality of generated images. This curriculum learning strategy systematically applies varying degrees of blurring to individual objects or the background during training, starting from strong blurring to progressively cleaner images. Our findings reveal that this approach yields significant performance improvements, stabilized training, smoother convergence, and reduced variance between multiple runs. Moreover, our technique demonstrates its versatility by being compatible with generative adversarial networks and diffusion models, underlining its applicability across various generative modeling paradigms. With ObjBlur, we reach new state-of-the-art results on the complex COCO and Visual Genome datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes