LGNIMar 4, 2025

Efficient and Optimal No-Regret Caching under Partial Observation

arXiv:2503.02758v1h-index: 20IEEE Transactions on Networking
Originality Incremental advance
AI Analysis

This addresses the need for efficient caching policies in practical scenarios like cellular base stations where full request history is unavailable, representing an incremental improvement over existing methods.

The paper tackles the caching problem under partial observation of past requests, proposing a randomized caching policy based on Follow-the-Perturbed-Leader that achieves asymptotically optimal regret with constant amortized time complexity.

Online learning algorithms have been successfully used to design caching policies with sublinear regret in the total number of requests, with no statistical assumption about the request sequence. Most existing algorithms involve computationally expensive operations and require knowledge of all past requests. However, this may not be feasible in practical scenarios like caching at a cellular base station. Therefore, we study the caching problem in a more restrictive setting where only a fraction of past requests are observed, and we propose a randomized caching policy with sublinear regret based on the classic online learning algorithm Follow-the-Perturbed-Leader (FPL). Our caching policy is the first to attain the asymptotically optimal regret bound while ensuring asymptotically constant amortized time complexity in the partial observability setting of requests. The experimental evaluation compares the proposed solution against classic caching policies and validates the proposed approach under synthetic and real-world request traces.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes