LG MLJan 29, 2019

Decentralized Online Learning: Take Benefits from Others' Data without Sharing Your Own to Track Global Trend

Yawei Zhao, Chen Yu, Peilin Zhao, Hanlin Tang, Shuang Qiu, Ji Liu

arXiv:1901.10593v48.119 citations

Originality Incremental advance

AI Analysis

This addresses the inefficiency in decentralized online learning for users like data providers who need to track global trends without sharing private data, though it is incremental as it builds on existing regret analysis.

The paper tackles the problem of whether decentralized online learning can actually reduce regret through communication, showing that under a loss function with adversarial and stochastic components, the Decentralized Online Gradient (DOG) algorithm achieves a regret bound of O(n√T G + √nT σ), where G and σ measure adversarial and stochastic components, enabling benefits from data randomness.

Decentralized Online Learning (online learning in decentralized networks) attracts more and more attention, since it is believed that Decentralized Online Learning can help the data providers cooperatively better solve their online problems without sharing their private data to a third party or other providers. Typically, the cooperation is achieved by letting the data providers exchange their models between neighbors, e.g., recommendation model. However, the best regret bound for a decentralized online learning algorithm is $\Ocal{n\sqrt{T}}$, where $n$ is the number of nodes (or users) and $T$ is the number of iterations. This is clearly insignificant since this bound can be achieved \emph{without} any communication in the networks. This reminds us to ask a fundamental question: \emph{Can people really get benefit from the decentralized online learning by exchanging information?} In this paper, we studied when and why the communication can help the decentralized online learning to reduce the regret. Specifically, each loss function is characterized by two components: the adversarial component and the stochastic component. Under this characterization, we show that decentralized online gradient (DOG) enjoys a regret bound $\Ocal{n\sqrt{T}G + \sqrt{nT}σ}$, where $G$ measures the magnitude of the adversarial component in the private data (or equivalently the local loss function) and $σ$ measures the randomness within the private data. This regret suggests that people can get benefits from the randomness in the private data by exchanging private information. Another important contribution of this paper is to consider the dynamic regret -- a more practical regret to track users' interest dynamics. Empirical studies are also conducted to validate our analysis.

View on arXiv PDF

Similar