Learning-Augmented Algorithms for $k$-median via Online Learning
This addresses the challenge of adapting clustering solutions to dynamic sequences for applications in data analysis, though it is incremental as it builds on existing learning-augmented and online learning frameworks.
The paper tackles the problem of designing learning-augmented algorithms for k-median clustering by introducing a novel online learning model, resulting in an efficient algorithm that approximately matches the average performance of the best fixed solution in hindsight across instances.
The field of learning-augmented algorithms seeks to use ML techniques on past instances of a problem to inform an algorithm designed for a future instance. In this paper, we introduce a novel model for learning-augmented algorithms inspired by online learning. In this model, we are given a sequence of instances of a problem and the goal of the learning-augmented algorithm is to use prior instances to propose a solution to a future instance of the problem. The performance of the algorithm is measured by its average performance across all the instances, where the performance on a single instance is the ratio between the cost of the algorithm's solution and that of an optimal solution for that instance. We apply this framework to the classic $k$-median clustering problem, and give an efficient learning algorithm that can approximately match the average performance of the best fixed $k$-median solution in hindsight across all the instances. We also experimentally evaluate our algorithm and show that its empirical performance is close to optimal, and also that it automatically adapts the solution to a dynamically changing sequence.