LGDMDSAug 10, 2012

Learning pseudo-Boolean k-DNF and Submodular Functions

arXiv:1208.2294v126 citations
Originality Incremental advance
AI Analysis

This provides a more efficient algorithm for learning submodular functions, which are important in optimization and machine learning, though it builds incrementally on prior work.

The paper tackles the problem of learning submodular functions with integer range by representing them as pseudo-Boolean k-DNF formulas and generalizing a PAC-learning algorithm, resulting in a time complexity polynomial in n and k^{O(k log k / ε)}, improving over previous n^{O(k)} query complexity. It also implies a property tester with query complexity polynomial in n for certain k values.

We prove that any submodular function f: {0,1}^n -> {0,1,...,k} can be represented as a pseudo-Boolean 2k-DNF formula. Pseudo-Boolean DNFs are a natural generalization of DNF representation for functions with integer range. Each term in such a formula has an associated integral constant. We show that an analog of Hastad's switching lemma holds for pseudo-Boolean k-DNFs if all constants associated with the terms of the formula are bounded. This allows us to generalize Mansour's PAC-learning algorithm for k-DNFs to pseudo-Boolean k-DNFs, and hence gives a PAC-learning algorithm with membership queries under the uniform distribution for submodular functions of the form f:{0,1}^n -> {0,1,...,k}. Our algorithm runs in time polynomial in n, k^{O(k \log k / ε)}, 1/εand log(1/δ) and works even in the agnostic setting. The line of previous work on learning submodular functions [Balcan, Harvey (STOC '11), Gupta, Hardt, Roth, Ullman (STOC '11), Cheraghchi, Klivans, Kothari, Lee (SODA '12)] implies only n^{O(k)} query complexity for learning submodular functions in this setting, for fixed epsilon and delta. Our learning algorithm implies a property tester for submodularity of functions f:{0,1}^n -> {0, ..., k} with query complexity polynomial in n for k=O((\log n/ \loglog n)^{1/2}) and constant proximity parameter ε.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes