DS LG MLFeb 10, 2025

On the query complexity of sampling from non-log-concave distributions

arXiv:2502.06200v37.36 citationsh-index: 3COLT

Originality Highly original

AI Analysis

This work addresses the fundamental challenge of sampling from complex distributions in high-dimensional spaces, which is crucial for applications in machine learning and statistics, and highlights a sharp contrast with prior results by showing exponential query complexity under weaker conditions.

The paper tackles the problem of sampling from non-log-concave distributions by establishing a tight query complexity bound, showing that any algorithm requires at least (LM/(dε))^{Ω(d)} queries to achieve ε-accuracy in total variation distance, and provides a matching upper bound with (LM/(dε))^{O(d)} queries.

We study the problem of sampling from a $d$-dimensional distribution with density $p(x)\propto e^{-f(x)}$, which does not necessarily satisfy good isoperimetric conditions. Specifically, we show that for any $L,M$ satisfying $LM\ge d\ge 5$, $ε\in \left(0,\frac{1}{32}\right)$, and any algorithm with query accesses to the value of $f(x)$ and $\nabla f(x)$, there exists an $L$-log-smooth distribution with second moment at most $M$ such that the algorithm requires $\left(\frac{LM}{dε}\right)^{Ω(d)}$ queries to compute a sample whose distribution is within $ε$ in total variation distance to the target distribution. We complement the lower bound with an algorithm requiring $\left(\frac{LM}{dε}\right)^{\mathcal{O}(d)}$ queries, thereby characterizing the tight (up to the constant in the exponent) query complexity for sampling from the family of non-log-concave distributions. Our results are in sharp contrast with the recent work of Huang et al. (COLT'24), where an algorithm with quasi-polynomial query complexity was proposed for sampling from a non-log-concave distribution when $M=\mathtt{poly}(d)$. Their algorithm works under the stronger condition that all distributions along the trajectory of the Ornstein-Uhlenbeck process, starting from the target distribution, are $\mathcal{O}(1)$-log-smooth. We investigate this condition and prove that it is strictly stronger than requiring the target distribution to be $\mathcal O(1)$-log-smooth. Additionally, we study this condition in the context of mixtures of Gaussians. Finally, we place our results within the broader theme of ``sampling versus optimization'', as studied in Ma et al. (PNAS'19). We show that for a wide range of parameters, sampling is strictly easier than optimization by a super-exponential factor in the dimension $d$.

View on arXiv PDF

Similar