LGITMLNov 20, 2019

On Universal Features for High-Dimensional Learning and Inference

arXiv:1911.09105v162 citations
Originality Synthesis-oriented
AI Analysis

This work provides a theoretical foundation for understanding and optimizing learning systems, but it is incremental as it builds on existing concepts like SVD and canonical correlation analysis.

The paper tackles the problem of identifying universal low-dimensional features from high-dimensional data for inference tasks in learning settings, introducing natural notions of universality and showing their local equivalence through an information geometry framework.

We consider the problem of identifying universal low-dimensional features from high-dimensional data for inference tasks in settings involving learning. For such problems, we introduce natural notions of universality and we show a local equivalence among them. Our analysis is naturally expressed via information geometry, and represents a conceptually and computationally useful analysis. The development reveals the complementary roles of the singular value decomposition, Hirschfeld-Gebelein-Rényi maximal correlation, the canonical correlation and principle component analyses of Hotelling and Pearson, Tishby's information bottleneck, Wyner's common information, Ky Fan $k$-norms, and Brieman and Friedman's alternating conditional expectations algorithm. We further illustrate how this framework facilitates understanding and optimizing aspects of learning systems, including multinomial logistic (softmax) regression and the associated neural network architecture, matrix factorization methods for collaborative filtering and other applications, rank-constrained multivariate linear regression, and forms of semi-supervised learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes