STITLGMLNov 21, 2022

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models

arXiv:2211.11368v57 citationsh-index: 25
AI Analysis

This work addresses the challenge of improving sample efficiency and design in spectral methods for mixed generalized linear models, which is incremental as it builds on existing methods with theoretical enhancements.

The paper tackles the problem of learning multiple signals from unlabeled observations in mixed generalized linear models by developing exact asymptotics for spectral methods in a proportional regime, enabling optimized design that reduces estimation error, as demonstrated through numerical simulations for mixed linear regression and phase retrieval.

In a mixed generalized linear model, the goal is to learn multiple signals from unlabeled observations: each sample comes from exactly one signal, but it is not known which one. We consider the prototypical problem of estimating two statistically independent signals in a mixed generalized linear model with Gaussian covariates. Spectral methods are a popular class of estimators which output the top two eigenvectors of a suitable data-dependent matrix. However, despite the wide applicability, their design is still obtained via heuristic considerations, and the number of samples $n$ needed to guarantee recovery is super-linear in the signal dimension $d$. In this paper, we develop exact asymptotics on spectral methods in the challenging proportional regime in which $n, d$ grow large and their ratio converges to a finite constant. This allows us optimize the design of the spectral method, and combine it with a simple linear estimator, to minimize the estimation error. Our characterization exploits a mix of tools from random matrices, free probability and the theory of approximate message passing algorithms. Numerical simulations for mixed linear regression and phase retrieval demonstrate the advantage enabled by our analysis over existing designs of spectral methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes