LGAICRApr 16, 2021

Lower Bounds on Cross-Entropy Loss in the Presence of Test-time Adversaries

arXiv:2104.08382v24 citations
AI Analysis

This work addresses the theoretical and practical challenge of understanding when robust learning is possible, though it is incremental as it builds on existing bounds and methods.

The paper tackles the problem of establishing fundamental limits for robust supervised learning by deriving optimal lower bounds on cross-entropy loss under test-time adversarial attacks, with results applied to practical datasets to reveal gaps in current robust training methods.

Understanding the fundamental limits of robust supervised learning has emerged as a problem of immense interest, from both practical and theoretical standpoints. In particular, it is critical to determine classifier-agnostic bounds on the training loss to establish when learning is possible. In this paper, we determine optimal lower bounds on the cross-entropy loss in the presence of test-time adversaries, along with the corresponding optimal classification outputs. Our formulation of the bound as a solution to an optimization problem is general enough to encompass any loss function depending on soft classifier outputs. We also propose and provide a proof of correctness for a bespoke algorithm to compute this lower bound efficiently, allowing us to determine lower bounds for multiple practical datasets of interest. We use our lower bounds as a diagnostic tool to determine the effectiveness of current robust training methods and find a gap from optimality at larger budgets. Finally, we investigate the possibility of using of optimal classification outputs as soft labels to empirically improve robust training.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes