DS LGNov 14, 2025

Learning and Testing Convex Functions

Renato Ferreira Pinto, Cassandra Marcussen, Elchanan Mossel, Shivam Nadimpalli

arXiv:2511.11498v13.32 citationsh-index: 60

Originality Incremental advance

AI Analysis

This addresses a foundational gap in machine learning for convex function analysis in high-dimensional continuous settings, though it is incremental relative to prior discrete or restricted studies.

The paper tackles the problem of learning and testing real-valued convex functions under Gaussian measures with Lipschitz smoothness, achieving an agnostic proper learning algorithm with error ε using n^O(1/ε²) samples and providing matching lower bounds in a correlational statistical query model.

We consider the problems of \emph{learning} and \emph{testing} real-valued convex functions over Gaussian space. Despite the extensive study of function convexity across mathematics, statistics, and computer science, its learnability and testability have largely been examined only in discrete or restricted settings -- typically with respect to the Hamming distance, which is ill-suited for real-valued functions. In contrast, we study these problems in high dimensions under the standard Gaussian measure, assuming sample access to the function and a mild smoothness condition, namely Lipschitzness. A smoothness assumption is natural and, in fact, necessary even in one dimension: without it, convexity cannot be inferred from finitely many samples. As our main results, we give: - Learning Convex Functions: An agnostic proper learning algorithm for Lipschitz convex functions that achieves error $\varepsilon$ using $n^{O(1/\varepsilon^2)}$ samples, together with a complementary lower bound of $n^{\mathrm{poly}(1/\varepsilon)}$ samples in the \emph{correlational statistical query (CSQ)} model. - Testing Convex Functions: A tolerant (two-sided) tester for convexity of Lipschitz functions with the same sample complexity (as a corollary of our learning result), and a one-sided tester (which never rejects convex functions) using $O(\sqrt{n}/\varepsilon)^n$ samples.

View on arXiv PDF

Similar