AIJun 2, 2025

On the Hardness of Approximating Distributions with Tractable Probabilistic Models

arXiv:2506.01281v29.62 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses the fundamental trade-off between expressivity and inference efficiency in probabilistic modeling, providing hardness results that highlight limitations in approximation for researchers in machine learning and theoretical computer science.

The paper tackles the challenge of approximating arbitrary distributions with tractable probabilistic models while maintaining efficient inference, showing that achieving bounded f-divergence approximation is NP-hard for models that can compute marginals tractably and proving an exponential size gap between different classes of probabilistic circuits.

A fundamental challenge in probabilistic modeling is to balance expressivity and inference efficiency. Tractable probabilistic models (TPMs) aim to directly address this tradeoff by imposing constraints that guarantee efficient inference of certain queries while maintaining expressivity. In particular, probabilistic circuits (PCs) provide a unifying framework for many TPMs, by characterizing families of models as circuits satisfying different structural properties. Because the complexity of inference on PCs is a function of the circuit size, understanding the size requirements of different families of PCs is fundamental in mapping the trade-off between tractability and expressive efficiency. However, the study of expressive efficiency of circuits are often concerned with exact representations, which may not align with model learning, where we look to approximate the underlying data distribution closely by some distance measure. Moreover, due to hardness of inference tasks, exactly representing distributions while supporting tractable inference often incurs exponential size blow-ups. In this paper, we consider a natural, yet so far underexplored, question: can we avoid such size blow-up by allowing for some small approximation error? We study approximating distributions with probabilistic circuits with guarantees based on $f$-divergences, and analyze which inference queries remain well-approximated under this framework. We show that approximating an arbitrary distribution with bounded $f$-divergence is $\mathsf{NP}$-hard for any model that can tractably compute marginals. In addition, we prove an exponential size gap for approximation between the class of decomposable PCs and that of decomposable and deterministic PCs.

View on arXiv PDF

Similar