LGAIMLJun 16, 2025

No-Regret Learning Under Adversarial Resource Constraints: A Spending Plan Is All You Need!

arXiv:2506.13244v36 citationsh-index: 17
Originality Incremental advance
AI Analysis

This addresses resource allocation and learning problems in adversarial environments, offering a novel approach to achieve no-regret guarantees, though it is incremental in extending existing frameworks.

The paper tackles online decision-making under adversarial resource constraints by introducing a spending plan framework, achieving sublinear regret in settings where it was previously impossible, with performance improvements when the plan is well-balanced.

We study online decision making problems under resource constraints, where both reward and cost functions are drawn from distributions that may change adversarially over time. We focus on two canonical settings: $(i)$ online resource allocation where rewards and costs are observed before action selection, and $(ii)$ online learning with resource constraints where they are observed after action selection, under full feedback or bandit feedback. It is well known that achieving sublinear regret in these settings is impossible when reward and cost distributions may change arbitrarily over time. To address this challenge, we analyze a framework in which the learner is guided by a spending plan--a sequence prescribing expected resource usage across rounds. We design general (primal-)dual methods that achieve sublinear regret with respect to baselines that follow the spending plan. Crucially, the performance of our algorithms improves when the spending plan ensures a well-balanced distribution of the budget across rounds. We additionally provide a robust variant of our methods to handle worst-case scenarios where the spending plan is highly imbalanced. To conclude, we study the regret of our algorithms when competing against benchmarks that deviate from the prescribed spending plan.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes