MESTAPCOMLDec 23, 2017

Optimization and Testing in Linear Non-Gaussian Component Analysis

arXiv:1712.08837v22 citations
Originality Incremental advance
AI Analysis

This work addresses a specific statistical modeling challenge in signal processing and data analysis, offering incremental improvements to existing methods for handling mixed Gaussian and non-Gaussian components.

The paper tackles the problem of estimating non-Gaussian and Gaussian components in linear non-Gaussian component analysis (LNGCA) by introducing an estimator that maximizes discrepancy from Gaussianity for non-Gaussian components and minimizes it for Gaussian ones, along with a statistical test to determine the number of non-Gaussian components; simulation studies show improvements over competing estimators, and real data applications demonstrate practical value.

Independent component analysis (ICA) decomposes multivariate data into mutually independent components (ICs). The ICA model is subject to a constraint that at most one of these components is Gaussian, which is required for model identifiability. Linear non-Gaussian component analysis (LNGCA) generalizes the ICA model to a linear latent factor model with any number of both non-Gaussian components (signals) and Gaussian components (noise), where observations are linear combinations of independent components. Although the individual Gaussian components are not identifiable, the Gaussian subspace is identifiable. We introduce an estimator along with its optimization approach in which non-Gaussian and Gaussian components are estimated simultaneously, maximizing the discrepancy of each non-Gaussian component from Gaussianity while minimizing the discrepancy of each Gaussian component from Gaussianity. When the number of non-Gaussian components is unknown, we develop a statistical test to determine it based on resampling and the discrepancy of estimated components. Through a variety of simulation studies, we demonstrate the improvements of our estimator over competing estimators, and we illustrate the effectiveness of the test to determine the number of non-Gaussian components. Further, we apply our method to real data examples and demonstrate its practical value.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes