Generalized Inverse Optimization through Online Learning
This work addresses the need for real-time inverse optimization applications, offering a novel online learning approach that is incremental over existing batch methods.
The paper tackles the problem of adapting inverse optimization to online settings, proposing a framework that achieves O(1/√T) convergence and demonstrates high accuracy, robustness to noise, and significant computational improvements over batch methods.
Inverse optimization is a powerful paradigm for learning preferences and restrictions that explain the behavior of a decision maker, based on a set of external signal and the corresponding decision pairs. However, most inverse optimization algorithms are designed specifically in batch setting, where all the data is available in advance. As a consequence, there has been rare use of these methods in an online setting suitable for real-time applications. In this paper, we propose a general framework for inverse optimization through online learning. Specifically, we develop an online learning algorithm that uses an implicit update rule which can handle noisy data. Moreover, under additional regularity assumptions in terms of the data and the model, we prove that our algorithm converges at a rate of $\mathcal{O}(1/\sqrt{T})$ and is statistically consistent. In our experiments, we show the online learning approach can learn the parameters with great accuracy and is very robust to noises, and achieves a dramatic improvement in computational efficacy over the batch learning approach.