CVDec 1, 2025

FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing

Yucheng Liao, Jiajun Liang, Kaiqian Cui, Baoquan Zhao, Haoran Xie, Wei Liu, Qing Li, Xudong Mao

arXiv:2512.01755v110.23 citationsh-index: 16

Originality Highly original

AI Analysis

This addresses the issue of severe quality degradation in multi-turn image editing for users of instruction-based models, representing a novel method for a known bottleneck.

The paper tackles the problem of quality degradation in multi-turn image editing by identifying progressive loss of high-frequency information as the cause, and presents FreqEdit, a training-free framework that enables stable editing across 10+ consecutive iterations with superior performance in identity preservation and instruction following compared to seven state-of-the-art baselines.

Instruction-based image editing through natural language has emerged as a powerful paradigm for intuitive visual manipulation. While recent models achieve impressive results on single edits, they suffer from severe quality degradation under multi-turn editing. Through systematic analysis, we identify progressive loss of high-frequency information as the primary cause of this quality degradation. We present FreqEdit, a training-free framework that enables stable editing across 10+ consecutive iterations. Our approach comprises three synergistic components: (1) high-frequency feature injection from reference velocity fields to preserve fine-grained details, (2) an adaptive injection strategy that spatially modulates injection strength for precise region-specific control, and (3) a path compensation mechanism that periodically recalibrates the editing trajectory to prevent over-constraint. Extensive experiments demonstrate that FreqEdit achieves superior performance in both identity preservation and instruction following compared to seven state-of-the-art baselines.

View on arXiv PDF

Similar