LGAIMar 21, 2025

Causally Aligned Curriculum Learning

arXiv:2503.16799v19 citationsh-index: 14ICLR
Originality Incremental advance
AI Analysis

This work addresses a practical problem in RL for applications with unobserved confounders, offering a novel causal approach that is incremental in improving curriculum learning methods.

The paper tackles the challenge of curriculum learning in reinforcement learning when optimal decision rules are not invariant due to unobserved confounders, by deriving a graphical condition for causally aligned source tasks and developing an algorithm that accelerates learning in confounded tasks with pixel observations.

A pervasive challenge in Reinforcement Learning (RL) is the "curse of dimensionality" which is the exponential growth in the state-action space when optimizing a high-dimensional target task. The framework of curriculum learning trains the agent in a curriculum composed of a sequence of related and more manageable source tasks. The expectation is that when some optimal decision rules are shared across source tasks and the target task, the agent could more quickly pick up the necessary skills to behave optimally in the environment, thus accelerating the learning process. However, this critical assumption of invariant optimal decision rules does not necessarily hold in many practical applications, specifically when the underlying environment contains unobserved confounders. This paper studies the problem of curriculum RL through causal lenses. We derive a sufficient graphical condition characterizing causally aligned source tasks, i.e., the invariance of optimal decision rules holds. We further develop an efficient algorithm to generate a causally aligned curriculum, provided with qualitative causal knowledge of the target task. Finally, we validate our proposed methodology through experiments in discrete and continuous confounded tasks with pixel observations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes