DSAILGPRAug 18, 2024

Parallel Sampling via Counting

arXiv:2408.09442v19 citationsh-index: 25
Originality Incremental advance
AI Analysis

This addresses a bottleneck in sampling for machine learning models like autoregressive neural networks, offering a theoretical speedup, though it is incremental as it builds on existing oracle-based methods.

The paper tackles the problem of sampling from arbitrary distributions on product spaces using parallelization and counting oracles, achieving a runtime of O(n^{2/3} polylog(n,q)), which is the first sublinear in n for such distributions, and suggests a potential n^{1/3}-factor speedup for autoregressive models.

We show how to use parallelization to speed up sampling from an arbitrary distribution $μ$ on a product space $[q]^n$, given oracle access to counting queries: $\mathbb{P}_{X\sim μ}[X_S=σ_S]$ for any $S\subseteq [n]$ and $σ_S \in [q]^S$. Our algorithm takes $O({n^{2/3}\cdot \operatorname{polylog}(n,q)})$ parallel time, to the best of our knowledge, the first sublinear in $n$ runtime for arbitrary distributions. Our results have implications for sampling in autoregressive models. Our algorithm directly works with an equivalent oracle that answers conditional marginal queries $\mathbb{P}_{X\sim μ}[X_i=σ_i\;\vert\; X_S=σ_S]$, whose role is played by a trained neural network in autoregressive models. This suggests a roughly $n^{1/3}$-factor speedup is possible for sampling in any-order autoregressive models. We complement our positive result by showing a lower bound of $\widetildeΩ(n^{1/3})$ for the runtime of any parallel sampling algorithm making at most $\operatorname{poly}(n)$ queries to the counting oracle, even for $q=2$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes