CVMay 17, 2025

Black-box Adversaries from Latent Space: Unnoticeable Attacks on Human Pose and Shape Estimation

arXiv:2505.12009v1h-index: 2
Originality Incremental advance
AI Analysis

This addresses security risks in digital human generation systems, particularly for live-streaming applications, though it is an incremental improvement over existing adversarial attacks.

The paper tackles security vulnerabilities in expressive human pose and shape (EHPS) estimation models by proposing an unnoticeable black-box attack (UBA) that increases pose estimation errors by 17.27%-58.21% on average without requiring internal model knowledge.

Expressive human pose and shape (EHPS) estimation is vital for digital human generation, particularly in live-streaming applications. However, most existing EHPS models focus primarily on minimizing estimation errors, with limited attention on potential security vulnerabilities. Current adversarial attacks on EHPS models often require white-box access (e.g., model details or gradients) or generate visually conspicuous perturbations, limiting their practicality and ability to expose real-world security threats. To address these limitations, we propose a novel Unnoticeable Black-Box Attack (UBA) against EHPS models. UBA leverages the latent-space representations of natural images to generate an optimal adversarial noise pattern and iteratively refine its attack potency along an optimized direction in digital space. Crucially, this process relies solely on querying the model's output, requiring no internal knowledge of the EHPS architecture, while guiding the noise optimization toward greater stealth and effectiveness. Extensive experiments and visual analyses demonstrate the superiority of UBA. Notably, UBA increases the pose estimation errors of EHPS models by 17.27%-58.21% on average, revealing critical vulnerabilities. These findings underscore the urgent need to address and mitigate security risks associated with digital human generation systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes