LGMLJun 14, 2025

Scaling Probabilistic Circuits via Monarch Matrices

arXiv:2506.12383v111 citationsh-index: 41ICML
Originality Highly original
AI Analysis

This work addresses scalability issues in tractable probabilistic models for machine learning applications, representing an incremental improvement by combining sparse properties and tensorized operations.

The paper tackles the problem of scaling probabilistic circuits by introducing a sparse and structured parameterization using Monarch matrices, which reduces memory and computation costs and achieves state-of-the-art generative modeling performance on benchmarks like Text8, LM1B, and ImageNet with substantially fewer FLOPs during training.

Probabilistic Circuits (PCs) are tractable representations of probability distributions allowing for exact and efficient computation of likelihoods and marginals. Recent advancements have improved the scalability of PCs either by leveraging their sparse properties or through the use of tensorized operations for better hardware utilization. However, no existing method fully exploits both aspects simultaneously. In this paper, we propose a novel sparse and structured parameterization for the sum blocks in PCs. By replacing dense matrices with sparse Monarch matrices, we significantly reduce the memory and computation costs, enabling unprecedented scaling of PCs. From a theory perspective, our construction arises naturally from circuit multiplication; from a practical perspective, compared to previous efforts on scaling up tractable probabilistic models, our approach not only achieves state-of-the-art generative modeling performance on challenging benchmarks like Text8, LM1B and ImageNet, but also demonstrates superior scaling behavior, achieving the same performance with substantially less compute as measured by the number of floating-point operations (FLOPs) during training.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes