DS LG ST MLJun 17, 2021

Statistical Query Lower Bounds for List-Decodable Linear Regression

Ilias Diakonikolas, Daniel M. Kane, Ankit Pensia, Thanasis Pittas, Alistair Stewart

arXiv:2106.09689v113.826 citations

Originality Highly original

AI Analysis

This provides evidence for the near-optimality of current methods in robust regression, addressing a problem in machine learning for handling adversarial noise.

The paper tackles list-decodable linear regression with adversarial corruption of a majority of examples, establishing a Statistical Query lower bound of d^poly(1/α) that matches existing algorithmic upper bounds.

We study the problem of list-decodable linear regression, where an adversary can corrupt a majority of the examples. Specifically, we are given a set $T$ of labeled examples $(x, y) \in \mathbb{R}^d \times \mathbb{R}$ and a parameter $0< α<1/2$ such that an $α$-fraction of the points in $T$ are i.i.d. samples from a linear regression model with Gaussian covariates, and the remaining $(1-α)$-fraction of the points are drawn from an arbitrary noise distribution. The goal is to output a small list of hypothesis vectors such that at least one of them is close to the target regression vector. Our main result is a Statistical Query (SQ) lower bound of $d^{\mathrm{poly}(1/α)}$ for this problem. Our SQ lower bound qualitatively matches the performance of previously developed algorithms, providing evidence that current upper bounds for this task are nearly best possible.

View on arXiv PDF

Similar