CVAIJul 4, 2025

Pose-Star: Anatomy-Aware Editing for Open-World Fashion Images

arXiv:2507.03402v11 citations
Originality Incremental advance
AI Analysis

This work addresses limitations in fashion image editing for real-world use, though it appears incremental by building on existing two-stage pipelines.

The paper tackles the problem of poor user-defined flexibility and weak pose robustness in fashion image editing by proposing Pose-Star, a framework that dynamically recomposes body structures into anatomy-aware masks, achieving enhanced localization and noise suppression for open-world applications.

To advance real-world fashion image editing, we analyze existing two-stage pipelines(mask generation followed by diffusion-based editing)which overly prioritize generator optimization while neglecting mask controllability. This results in two critical limitations: I) poor user-defined flexibility (coarse-grained human masks restrict edits to predefined regions like upper torso; fine-grained clothes masks preserve poses but forbid style/length customization). II) weak pose robustness (mask generators fail due to articulated poses and miss rare regions like waist, while human parsers remain limited by predefined categories). To address these gaps, we propose Pose-Star, a framework that dynamically recomposes body structures (e.g., neck, chest, etc.) into anatomy-aware masks (e.g., chest-length) for user-defined edits. In Pose-Star, we calibrate diffusion-derived attention (Star tokens) via skeletal keypoints to enhance rare structure localization in complex poses, suppress noise through phase-aware analysis of attention dynamics (Convergence,Stabilization,Divergence) with threshold masking and sliding-window fusion, and refine edges via cross-self attention merging and Canny alignment. This work bridges controlled benchmarks and open-world demands, pioneering anatomy-aware, pose-robust editing and laying the foundation for industrial fashion image editing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes