CVMar 10

CIGPose: Causal Intervention Graph Neural Network for Whole-Body Pose Estimation

arXiv:2603.09418v166.61 citationsh-index: 4Has Code
Predicted impact top 60% in CV · last 90 daysOriginality Highly original
AI Analysis

This improves pose estimation accuracy for applications like human-computer interaction, though it is incremental as it builds on existing graph neural network methods.

The paper tackles the problem of whole-body pose estimation lacking robustness by addressing spurious correlations from visual context using a causal intervention approach, achieving a new state-of-the-art of 67.0% AP on COCO-WholeBody and 67.5% AP with additional data.

State-of-the-art whole-body pose estimators often lack robustness, producing anatomically implausible predictions in challenging scenes. We posit this failure stems from spurious correlations learned from visual context, a problem we formalize using a Structural Causal Model (SCM). The SCM identifies visual context as a confounder that creates a non-causal backdoor path, corrupting the model's reasoning. We introduce the Causal Intervention Graph Pose (CIGPose) framework to address this by approximating the true causal effect between visual evidence and pose. The core of CIGPose is a novel Causal Intervention Module: it first identifies confounded keypoint representations via predictive uncertainty and then replaces them with learned, context-invariant canonical embeddings. These deconfounded embeddings are processed by a hierarchical graph neural network that reasons over the human skeleton at both local and global semantic levels to enforce anatomical plausibility. Extensive experiments show CIGPose achieves a new state-of-the-art on COCO-WholeBody. Notably, our CIGPose-x model achieves 67.0\% AP, surpassing prior methods that rely on extra training data. With the additional UBody dataset, CIGPose-x is further boosted to 67.5\% AP, demonstrating superior robustness and data efficiency. The codes and models are publicly available at https://github.com/53mins/CIGPose.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes