MELGMay 31

Cellwise and Casewise Robust Covariance in High Dimensions

arXiv:2505.1992551.33 citationsh-index: 76
Predicted impact top 14% in ME · last 90 daysOriginality Incremental advance
AI Analysis

It addresses the need for robust covariance estimation in high-dimensional data with multiple outlier types, a problem for statisticians and data analysts.

The paper proposes cellRCov, a robust covariance estimator that handles casewise outliers, cellwise outliers, and missing data in high dimensions, demonstrating superior performance in simulations and a real-world anomaly detection application.

The sample covariance matrix is a cornerstone of multivariate statistics, but it is highly sensitive to outliers. These can be casewise outliers, such as cases belonging to a different population, or cellwise outliers, which are deviating cells (entries) of the data matrix. Recently some robust covariance estimators have been developed that can handle both types of outliers, but their computation is only feasible up to at most 20 dimensions. To remedy this we propose the cellRCov method, a robust covariance estimator that simultaneously handles casewise outliers, cellwise outliers, and missing data. It relies on a decomposition of the covariance on principal and orthogonal subspaces, leveraging recent work on robust PCA. It also employs a ridge-type regularization to stabilize the estimated covariance matrix. We establish some theoretical properties of cellRCov, including its casewise and cellwise influence functions as well as consistency and asymptotic normality. A simulation study demonstrates the superior performance of cellRCov in contaminated and missing data scenarios. Furthermore, its practical utility is illustrated in a real-world application to anomaly detection. We also construct and illustrate the cellRCCA method for robust and regularized canonical correlation analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes