LG DS MLNov 18, 2024

Reliable Learning of Halfspaces under Gaussian Marginals

Ilias Diakonikolas, Lisheng Ren, Nikos Zarifis

arXiv:2411.11238v12.6h-index: 14NIPS

Originality Incremental advance

AI Analysis

This work addresses reliable learning scenarios for machine learning practitioners dealing with cost-sensitive errors in halfspace classification under Gaussian distributions, showing a computational separation from standard agnostic learning.

The paper tackles the problem of PAC learning halfspaces in the reliable agnostic model, where one type of error is more costly, by proposing a new algorithm for Gaussian halfspaces with sample and computational complexity expressed as d^{O(log(min{1/α, 1/ε}))}min(2^{log(1/ε)^{O(log(1/α))}},2^{poly(1/ε)}), and provides a Statistical Query lower bound suggesting the d^{Ω(log(1/α))} dependence is optimal.

We study the problem of PAC learning halfspaces in the reliable agnostic model of Kalai et al. (2012). The reliable PAC model captures learning scenarios where one type of error is costlier than the others. Our main positive result is a new algorithm for reliable learning of Gaussian halfspaces on $\mathbb{R}^d$ with sample and computational complexity $$d^{O(\log (\min\{1/α, 1/ε\}))}\min (2^{\log(1/ε)^{O(\log (1/α))}},2^{\mathrm{poly}(1/ε)})\;,$$ where $ε$ is the excess error and $α$ is the bias of the optimal halfspace. We complement our upper bound with a Statistical Query lower bound suggesting that the $d^{Ω(\log (1/α))}$ dependence is best possible. Conceptually, our results imply a strong computational separation between reliable agnostic learning and standard agnostic learning of halfspaces in the Gaussian setting.

View on arXiv PDF

Similar