DS CC LGJul 2, 2019

Learning from satisfying assignments under continuous distributions

Clément L. Canonne, Anindya De, Rocco A. Servedio

arXiv:1907.01619v17.37 citations

Originality Highly original

AI Analysis

This work addresses a theoretical machine learning problem for distribution learning, extending prior discrete frameworks to continuous settings with implications for understanding learnability under different background distributions.

The paper tackles the problem of learning probability distributions defined by satisfying assignments of Boolean-valued functions over continuous domains, showing efficient algorithms for degree-1 PTFs with log-concave distributions and degree-2 PTFs with normal distributions, but proving hardness for higher degrees based on cryptographic assumptions.

What kinds of functions are learnable from their satisfying assignments? Motivated by this simple question, we extend the framework of De, Diakonikolas, and Servedio [DDS15], which studied the learnability of probability distributions over $\{0,1\}^n$ defined by the set of satisfying assignments to "low-complexity" Boolean functions, to Boolean-valued functions defined over continuous domains. In our learning scenario there is a known "background distribution" $\mathcal{D}$ over $\mathbb{R}^n$ (such as a known normal distribution or a known log-concave distribution) and the learner is given i.i.d. samples drawn from a target distribution $\mathcal{D}_f$, where $\mathcal{D}_f$ is $\mathcal{D}$ restricted to the satisfying assignments of an unknown low-complexity Boolean-valued function $f$. The problem is to learn an approximation $\mathcal{D}'$ of the target distribution $\mathcal{D}_f$ which has small error as measured in total variation distance. We give a range of efficient algorithms and hardness results for this problem, focusing on the case when $f$ is a low-degree polynomial threshold function (PTF). When the background distribution $\mathcal{D}$ is log-concave, we show that this learning problem is efficiently solvable for degree-1 PTFs (i.e.,~linear threshold functions) but not for degree-2 PTFs. In contrast, when $\mathcal{D}$ is a normal distribution, we show that this learning problem is efficiently solvable for degree-2 PTFs but not for degree-4 PTFs. Our hardness results rely on standard assumptions about secure signature schemes.

View on arXiv PDF

Similar