CVMay 15, 2025

ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization

arXiv:2505.10250v28 citationsh-index: 29Has CodeICML
Originality Incremental advance
AI Analysis

This addresses misalignment and robustness issues in probabilistic HMR for in-the-wild images, representing an incremental improvement through preference optimization.

The paper tackles the problem of aligning diffusion-based human mesh recovery (HMR) predictions with 2D image observations by using direct preference optimization, resulting in a method that outperforms current state-of-the-art approaches.

Human mesh recovery (HMR) from a single image is inherently ill-posed due to depth ambiguity and occlusions. Probabilistic methods have tried to solve this by generating numerous plausible 3D human mesh predictions, but they often exhibit misalignment with 2D image observations and weak robustness to in-the-wild images. To address these issues, we propose ADHMR, a framework that Aligns a Diffusion-based HMR model in a preference optimization manner. First, we train a human mesh prediction assessment model, HMR-Scorer, capable of evaluating predictions even for in-the-wild images without 3D annotations. We then use HMR-Scorer to create a preference dataset, where each input image has a pair of winner and loser mesh predictions. This dataset is used to finetune the base model using direct preference optimization. Moreover, HMR-Scorer also helps improve existing HMR models by data cleaning, even with fewer training samples. Extensive experiments show that ADHMR outperforms current state-of-the-art methods. Code is available at: https://github.com/shenwenhao01/ADHMR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes