LG AIFeb 23, 2025

Adaptive Conformal Guidance for Learning under Uncertainty

Rui Liu, Peng Gao, Yu Shen, Ming Lin, Pratap Tokekar

arXiv:2502.16736v4h-index: 30

Originality Incremental advance

AI Analysis

This addresses the issue of degraded learning performance due to imperfect guidance for practitioners in domains like autonomous driving and image classification, though it is an incremental improvement on existing guidance methods.

The paper tackles the problem of noisy or misaligned guidance signals in machine learning by proposing Adaptive Conformal Guidance (AdaConG), which dynamically adjusts reliance on guidance based on uncertainty, resulting in improved performance such as over 6x higher rewards in gridworld navigation.

Learning with guidance has proven effective across a wide range of machine learning systems. Guidance may, for example, come from annotated datasets in supervised learning, pseudo-labels in semi-supervised learning, and expert demonstration policies in reinforcement learning. However, guidance signals can be noisy due to domain shifts and limited data availability and may not generalize well. Blindly trusting such signals when they are noisy, incomplete, or misaligned with the target domain can lead to degraded performance. To address these challenges, we propose Adaptive Conformal Guidance (AdaConG), a simple yet effective approach that dynamically modulates the influence of guidance signals based on their associated uncertainty, quantified via split conformal prediction (CP). By adaptively adjusting to guidance uncertainty, AdaConG enables models to reduce reliance on potentially misleading signals and enhance learning performance. We validate AdaConG across diverse tasks, including knowledge distillation, semi-supervised image classification, gridworld navigation, and autonomous driving. Experimental results demonstrate that AdaConG improves performance and robustness under imperfect guidance, e.g., in gridworld navigation, it accelerates convergence and achieves over $6\times$ higher rewards than the best-performing baseline. These results highlight AdaConG as a broadly applicable solution for learning under uncertainty.

View on arXiv PDF

Similar