MLLGJun 5, 2020

The Expected Jacobian Outerproduct: Theory and Empirics

arXiv:2006.03550v12 citations
Originality Incremental advance
AI Analysis

This work addresses the limitation of EGOP to regression by extending it to classification, offering potential improvements in metric learning for practitioners.

The authors adapted the expected gradient outerproduct (EGOP) operator from regression to multi-class classification, introducing the expected Jacobian outerproduct (EJOP) and proposing a statistically consistent estimator. They demonstrated that using EJOP as a metric or for initialization improves performance in non-parametric classification tasks.

The expected gradient outerproduct (EGOP) of an unknown regression function is an operator that arises in the theory of multi-index regression, and is known to recover those directions that are most relevant to predicting the output. However, work on the EGOP, including that on its cheap estimators, is restricted to the regression setting. In this work, we adapt this operator to the multi-class setting, which we dub the expected Jacobian outerproduct (EJOP). Moreover, we propose a simple rough estimator of the EJOP and show that somewhat surprisingly, it remains statistically consistent under mild assumptions. Furthermore, we show that the eigenvalues and eigenspaces also remain consistent. Finally, we show that the estimated EJOP can be used as a metric to yield improvements in real-world non-parametric classification tasks: both by its use as a metric, and also as cheap initialization in metric learning tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes