NeurIPS2021

Statistical Query Lower Bounds for List-Decodable Linear Regression

Ilias Diakonikolas, Daniel Kane, Ankit Pensia, Thanasis Pittas, Alistair Stewart

26 citations

Abstract

We study the problem of list-decodable linear regression, where an adversary can corrupt a majority of the examples. Specifically, we are given a set TT of labeled examples (x,y)Rd×R(x, y) \in \mathbb{R}^d \times \mathbb{R} and a parameter 0<α<1/20<\alpha<1/2 such that an α\alpha-fraction of the points in TT are i.i.d. samples from a linear regression model with Gaussian covariates, and the remaining (1α)(1-\alpha)-fraction of the points are drawn from an arbitrary noise distribution. The goal is to output a small list of hypothesis vectors such that at least one of them is close to the target regression vector. Our main result is a Statistical Query (SQ) lower bound of dpoly(1/α)d^{\mathrm{poly}(1/\alpha)} for this problem. Our SQ lower bound qualitatively matches the performance of previously developed algorithms, providing evidence that current upper bounds for this task are nearly best possible.