CVApr 30, 2022

Composition-aware Graphic Layout GAN for Visual-textual Presentation Designs

arXiv:2205.00303v391 citationsh-index: 29
Originality Incremental advance
AI Analysis

This addresses the graphic layout generation problem for advertising and design applications, but it is incremental as it builds on prior work with a novel domain alignment module.

The paper tackles the problem of generating visual-textual presentation designs by proposing CGL-GAN, a deep generative model that synthesizes layouts based on image compositions, and demonstrates high-quality results through evaluations on a dataset of 60,548 advertising posters.

In this paper, we study the graphic layout generation problem of producing high-quality visual-textual presentation designs for given images. We note that image compositions, which contain not only global semantics but also spatial information, would largely affect layout results. Hence, we propose a deep generative model, dubbed as composition-aware graphic layout GAN (CGL-GAN), to synthesize layouts based on the global and spatial visual contents of input images. To obtain training images from images that already contain manually designed graphic layout data, previous work suggests masking design elements (e.g., texts and embellishments) as model inputs, which inevitably leaves hint of the ground truth. We study the misalignment between the training inputs (with hint masks) and test inputs (without masks), and design a novel domain alignment module (DAM) to narrow this gap. For training, we built a large-scale layout dataset which consists of 60,548 advertising posters with annotated layout information. To evaluate the generated layouts, we propose three novel metrics according to aesthetic intuitions. Through both quantitative and qualitative evaluations, we demonstrate that the proposed model can synthesize high-quality graphic layouts according to image compositions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes