ROAILGJul 25, 2025

GABRIL: Gaze-Based Regularization for Mitigating Causal Confusion in Imitation Learning

arXiv:2507.19647v15 citationsh-index: 22IROS
Originality Incremental advance
AI Analysis

This addresses the problem of poor generalization in imitation learning for robotics and gaming domains, though it is an incremental advance.

The paper tackles causal confusion in imitation learning by introducing GABRIL, a method that uses human gaze data for regularization, resulting in performance improvements of 179% over baselines in Atari and 76% in CARLA.

Imitation Learning (IL) is a widely adopted approach which enables agents to learn from human expert demonstrations by framing the task as a supervised learning problem. However, IL often suffers from causal confusion, where agents misinterpret spurious correlations as causal relationships, leading to poor performance in testing environments with distribution shift. To address this issue, we introduce GAze-Based Regularization in Imitation Learning (GABRIL), a novel method that leverages the human gaze data gathered during the data collection phase to guide the representation learning in IL. GABRIL utilizes a regularization loss which encourages the model to focus on causally relevant features identified through expert gaze and consequently mitigates the effects of confounding variables. We validate our approach in Atari environments and the Bench2Drive benchmark in CARLA by collecting human gaze datasets and applying our method in both domains. Experimental results show that the improvement of GABRIL over behavior cloning is around 179% more than the same number for other baselines in the Atari and 76% in the CARLA setup. Finally, we show that our method provides extra explainability when compared to regular IL agents.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes