LGGTMLNov 28, 2019

Understand Dynamic Regret with Switching Cost for Online Decision Making

arXiv:1911.12595v115 citations
Originality Incremental advance
AI Analysis

This work provides theoretical insights for researchers in online optimization, though it is incremental as it builds on existing regret analysis.

The paper investigates the relationship between dynamic regret and switching cost in online decision making, finding that switching cost significantly impacts dynamic regret in Online Algorithms but not in Online Convex Optimization, with a lower bound for OCO matching the no-switching cost case.

As a metric to measure the performance of an online method, dynamic regret with switching cost has drawn much attention for online decision making problems. Although the sublinear regret has been provided in many previous researches, we still have little knowledge about the relation between the dynamic regret and the switching cost. In the paper, we investigate the relation for two classic online settings: Online Algorithms (OA) and Online Convex Optimization (OCO). We provide a new theoretical analysis framework, which shows an interesting observation, that is, the relation between the switching cost and the dynamic regret is different for settings of OA and OCO. Specifically, the switching cost has significant impact on the dynamic regret in the setting of OA. But, it does not have an impact on the dynamic regret in the setting of OCO. Furthermore, we provide a lower bound of regret for the setting of OCO, which is same with the lower bound in the case of no switching cost. It shows that the switching cost does not change the difficulty of online decision making problems in the setting of OCO.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes