CVApr 17

Towards Design Compositing

arXiv:2604.1460568.7h-index: 6
Predicted impact top 43% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For graphic design automation, GIST addresses the overlooked problem of stylistic mismatch between input components, enabling plug-and-play harmonization without retraining.

GIST is a training-free, identity-preserving image compositor that harmonizes visually mismatched input elements in graphic design pipelines, improving visual harmony and aesthetic quality over naive pasting as validated by LLaVA-OV and GPT-4V.

Graphic design creation involves harmoniously assembling multimodal components such as images, text, logos, and other visual assets collected from diverse sources, into a visually-appealing and cohesive design. Recent methods have largely focused on layout prediction or complementary element generation, while retaining input elements exactly, implicitly assuming that provided components are already stylistically harmonious. In practice, inputs often come from disparate sources and exhibit visual mismatch, making this assumption limiting. We argue that identity-preserving stylization and compositing of input elements is a critical missing ingredient for truly harmonized components-to-design pipelines. To this end, we propose GIST, a training-free, identity-preserving image compositor that sits between layout prediction and typography generation, and can be plugged into any existing components-to-design or design-refining pipeline without modification. We demonstrate this by integrating GIST with two substantially different existing methods, LaDeCo and Design-o-meter. GIST shows significant improvements in visual harmony and aesthetic quality across both pipelines, as validated by LLaVA-OV and GPT-4V on aspect-wise ratings and pairwise preference over naive pasting. Project Page: abhinav-mahajan10.github.io/GIST/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes