ML LG OCNov 30, 2020

Persistent Reductions in Regularized Loss Minimization for Variable Selection

arXiv:2011.14549v11.4

Originality Incremental advance

AI Analysis

This work provides a method for practitioners to reduce the dimensionality of variable selection problems without needing to run iterative optimization, which is useful for large datasets.

This paper introduces a method to identify a subset of features that will have zero coefficients in all optimal solutions for a broad class of regularized loss minimization problems, even before any optimization. This procedure is efficient and applicable to ultra-high dimensional problems.

In the context of regularized loss minimization with polyhedral gauges, we show that for a broad class of loss functions (possibly non-smooth and non-convex) and under a simple geometric condition on the input data it is possible to efficiently identify a subset of features which are guaranteed to have zero coefficients in all optimal solutions in all problems with loss functions from said class, before any iterative optimization has been performed for the original problem. This procedure is standalone, takes only the data as input, and does not require any calls to the loss function. Therefore, we term this procedure as a persistent reduction for the aforementioned class of regularized loss minimization problems. This reduction can be efficiently implemented via an extreme ray identification subroutine applied to a polyhedral cone formed from the datapoints. We employ an existing output-sensitive algorithm for extreme ray identification which makes our guarantee and algorithm applicable in ultra-high dimensional problems.

View on arXiv PDF

Similar