LGCVJul 11, 2025

Confounder-Free Continual Learning via Recursive Feature Normalization

arXiv:2507.09031v2h-index: 38ICML
AI Analysis

This addresses the problem of spurious correlations in continual learning for AI systems, though it is incremental as it builds on existing metadata normalization methods.

The paper tackled the challenge of removing confounder effects in continual learning to prevent biased predictions, introducing the Recursive MDN layer that reduces catastrophic forgetting and promotes equitable predictions across population groups.

Confounders are extraneous variables that affect both the input and the target, resulting in spurious correlations and biased predictions. There are recent advances in dealing with or removing confounders in traditional models, such as metadata normalization (MDN), where the distribution of the learned features is adjusted based on the study confounders. However, in the context of continual learning, where a model learns continuously from new data over time without forgetting, learning feature representations that are invariant to confounders remains a significant challenge. To remove their influence from intermediate feature representations, we introduce the Recursive MDN (R-MDN) layer, which can be integrated into any deep learning architecture, including vision transformers, and at any model stage. R-MDN performs statistical regression via the recursive least squares algorithm to maintain and continually update an internal model state with respect to changing distributions of data and confounding variables. Our experiments demonstrate that R-MDN promotes equitable predictions across population groups, both within static learning and across different stages of continual learning, by reducing catastrophic forgetting caused by confounder effects changing over time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes