LGMay 12

IGT-OMD: Implicit Gradient Transport for Decision-Focused Learning under Delayed Feedback

Benjamin Amoh, Geoffrey G. Parker, Wesley Marrero

arXiv:2605.1269310.0

Predicted impact top 76% in LG · last 90 daysOriginality Highly original

AI Analysis

For online decision-focused learning, this work solves a fundamental failure mode (staleness amplification) that previously limited practical deployment under delay.

IGT-OMD addresses delayed feedback in decision-focused learning, proving that gradient staleness causes quadratically growing transport error. The algorithm reduces this to linear dependence on delay, achieving the first sublinear regret bound and reducing decision loss by 17–55% on benchmarks.

Decision-focused learning trains predictive models end-to-end against downstream decision loss, but online settings suffer delayed feedback: outcomes may not arrive for many environment interactions. We identify \emph{staleness amplification}, a failure mode unique to bilevel optimization under delay, in which gradient staleness couples with inner-solver sensitivity to inflate regret beyond single-level delay theory. We prove that any black-box delayed optimizer incurs an irreducible regret cost from inner-solver approximation error, and that gradient staleness contributes a quadratically growing transport error without bilevel-aware correction. Our algorithm, \textbf{IGT-OMD}, applies Implicit Gradient Transport to hypergradients within Online Mirror Descent, re-evaluating stale gradients at the current parameters using stored inner solutions. This method reduces transport error from a quadratic to a linear dependence on delay and achieves the first sublinear regret bound for delayed bilevel optimization with queue-length-adaptive step sizes. Controlled experiments provide a \emph{mechanistic fingerprint}: transport benefit is exactly $0.0\%$ ($p=1.00$) at unit delay and grows monotonically to $9.5\%$ at fifty rounds ($p<0.001$), isolating the correction's effect. On Linear Quadratic Regulator, Warcraft shortest-path, and Sinkhorn optimal transport, IGT-OMD reduces decision loss by $17$--$55\%$ relative to single-level baselines, with phase transitions matching the theory.

View on arXiv PDF

Similar