CVOct 6, 2025

AvatarVTON: 4D Virtual Try-On for Animatable Avatars

arXiv:2510.04822v11 citationsh-index: 4
Originality Highly original
AI Analysis

This addresses the need for dynamic garment interactions in AR/VR, gaming, and digital-human applications without requiring multi-view captures or physics priors.

The paper tackles the problem of 4D virtual try-on for animatable avatars by proposing AvatarVTON, a framework that generates realistic try-on results from a single garment image, enabling free pose control, novel-view rendering, and diverse garment choices, achieving high fidelity, diversity, and dynamic garment realism.

We propose AvatarVTON, the first 4D virtual try-on framework that generates realistic try-on results from a single in-shop garment image, enabling free pose control, novel-view rendering, and diverse garment choices. Unlike existing methods, AvatarVTON supports dynamic garment interactions under single-view supervision, without relying on multi-view garment captures or physics priors. The framework consists of two key modules: (1) a Reciprocal Flow Rectifier, a prior-free optical-flow correction strategy that stabilizes avatar fitting and ensures temporal coherence; and (2) a Non-Linear Deformer, which decomposes Gaussian maps into view-pose-invariant and view-pose-specific components, enabling adaptive, non-linear garment deformations. To establish a benchmark for 4D virtual try-on, we extend existing baselines with unified modules for fair qualitative and quantitative comparisons. Extensive experiments show that AvatarVTON achieves high fidelity, diversity, and dynamic garment realism, making it well-suited for AR/VR, gaming, and digital-human applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes