CVHCFeb 14, 2025

PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control

arXiv:2502.10258v12 citationsh-index: 3ICASSP
Originality Incremental advance
AI Analysis

This is an incremental improvement for users in image editing workflows, offering efficiency and flexibility.

The paper tackles the problem of multi-instruction image editing by enabling users to apply multiple edits with masks in a single pass, eliminating iterative refinement and achieving precise control through a novel attention mechanism.

We present PromptArtisan, a groundbreaking approach to multi-instruction image editing that achieves remarkable results in a single pass, eliminating the need for time-consuming iterative refinement. Our method empowers users to provide multiple editing instructions, each associated with a specific mask within the image. This flexibility allows for complex edits involving mask intersections or overlaps, enabling the realization of intricate and nuanced image transformations. PromptArtisan leverages a pre-trained InstructPix2Pix model in conjunction with a novel Complete Attention Control Mechanism (CACM). This mechanism ensures precise adherence to user instructions, granting fine-grained control over the editing process. Furthermore, our approach is zero-shot, requiring no additional training, and boasts improved processing complexity compared to traditional iterative methods. By seamlessly integrating multi-instruction capabilities, single-pass efficiency, and complete attention control, PromptArtisan unlocks new possibilities for creative and efficient image editing workflows, catering to both novice and expert users alike.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes