LGMLJun 17, 2019

Online Matrix Completion with Side Information

arXiv:1906.07255v314 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of efficient online matrix completion for applications like recommendation systems, but it is incremental as it builds on existing matrix factorization and online learning frameworks.

The paper tackles online binary matrix completion with side information by proposing an algorithm and proving mistake and regret bounds of the form $ ilde{O}(D/γ^2)$, where $γ$ is a margin term and $D$ measures side information quality, showing that with predictive side information, $D$ can be as low as $O(k + \ell)$ for distinct row and column factors.

We give an online algorithm and prove novel mistake and regret bounds for online binary matrix completion with side information. The mistake bounds we prove are of the form $\tilde{O}(D/γ^2)$. The term $1/γ^2$ is analogous to the usual margin term in SVM (perceptron) bounds. More specifically, if we assume that there is some factorization of the underlying $m \times n$ matrix into $P Q^\intercal$ where the rows of $P$ are interpreted as "classifiers" in $\mathcal{R}^d$ and the rows of $Q$ as "instances" in $\mathcal{R}^d$, then $γ$ is the maximum (normalized) margin over all factorizations $P Q^\intercal$ consistent with the observed matrix. The quasi-dimension term $D$ measures the quality of side information. In the presence of vacuous side information, $D= m+n$. However, if the side information is predictive of the underlying factorization of the matrix, then in an ideal case, $D \in O(k + \ell)$ where $k$ is the number of distinct row factors and $\ell$ is the number of distinct column factors. We additionally provide a generalization of our algorithm to the inductive setting. In this setting, we provide an example where the side information is not directly specified in advance. For this example, the quasi-dimension $D$ is now bounded by $O(k^2 + \ell^2)$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes