GRCVMMMar 12, 2025

InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images

arXiv:2503.09130v11 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses a challenging task in image editing for creative and practical applications, but it is incremental as it builds on prior work with a novel regularization approach.

The paper tackles the problem of editing human-object interactions in images to transform existing interactions into new ones while preserving subject and object identities, achieving significant outperformance over existing methods as demonstrated on the introduced IEBench benchmark.

This paper presents InteractEdit, a novel framework for zero-shot Human-Object Interaction (HOI) editing, addressing the challenging task of transforming an existing interaction in an image into a new, desired interaction while preserving the identities of the subject and object. Unlike simpler image editing scenarios such as attribute manipulation, object replacement or style transfer, HOI editing involves complex spatial, contextual, and relational dependencies inherent in humans-objects interactions. Existing methods often overfit to the source image structure, limiting their ability to adapt to the substantial structural modifications demanded by new interactions. To address this, InteractEdit decomposes each scene into subject, object, and background components, then employs Low-Rank Adaptation (LoRA) and selective fine-tuning to preserve pretrained interaction priors while learning the visual identity of the source image. This regularization strategy effectively balances interaction edits with identity consistency. We further introduce IEBench, the most comprehensive benchmark for HOI editing, which evaluates both interaction editing and identity preservation. Our extensive experiments show that InteractEdit significantly outperforms existing methods, establishing a strong baseline for future HOI editing research and unlocking new possibilities for creative and practical applications. Code will be released upon publication.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes