Move and Act: Enhanced Object Manipulation and Background Integrity for Image Editing
This work addresses image editing challenges for users needing precise object manipulation and background preservation, representing an incremental improvement over existing methods.
The paper tackles the problem of consistent image editing by addressing limitations in position control and background preservation of existing three-branch methods, proposing a tuning-free two-branch approach that achieves improved background integrity and allows simultaneous editing of object action and generation position.
Current methods commonly utilize three-branch structures of inversion, reconstruction, and editing, to tackle consistent image editing task. However, these methods lack control over the generation position of the edited object and have issues with background preservation. To overcome these limitations, we propose a tuning-free method with only two branches: inversion and editing. This approach allows users to simultaneously edit the object's action and control the generation position of the edited object. Additionally, it achieves improved background preservation. Specifically, we transfer the edited object information to the target area and repair or preserve the background of other areas during the inversion process at a specific time step. In the editing stage, we use the image features in self-attention to query the key and value of the corresponding time step in the inversion to achieve consistent image editing. Impressive image editing results and quantitative evaluation demonstrate the effectiveness of our method. The code is available at https://github.com/mobiushy/move-act.