STDSLGMLJun 30, 2025

Sampling and Identity-Testing Without Approximate Tensorization of Entropy

arXiv:2506.23456v11 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses challenges in high-dimensional statistics for distributions lacking ATE, such as mixtures, with incremental improvements over prior work on Poincaré inequalities.

The paper tackles the problem of sampling and identity-testing for mixtures of distributions that satisfy approximate tensorization of entropy (ATE), showing that Glauber dynamics mixes fast with optimal sample complexity and providing efficient identity-testers in the coordinate-conditional sampling model.

Certain tasks in high-dimensional statistics become easier when the underlying distribution satisfies a local-to-global property called approximate tensorization of entropy (ATE). For example, the Glauber dynamics Markov chain of an ATE distribution mixes fast and can produce approximate samples in a small amount of time, since such a distribution satisfies a modified log-Sobolev inequality. Moreover, identity-testing for an ATE distribution requires few samples if the tester is given coordinate conditional access to the unknown distribution, as shown by Blanca, Chen, Štefankovič, and Vigoda (COLT 2023). A natural class of distributions that do not satisfy ATE consists of mixtures of (few) distributions that do satisfy ATE. We study the complexity of identity-testing and sampling for these distributions. Our main results are the following: 1. We show fast mixing of Glauber dynamics from a data-based initialization, with optimal sample complexity, for mixtures of distributions satisfying modified log-Sobolev inequalities. This extends work of Huang, Koehler, Lee, Mohanty, Rajaraman, Vuong, and Wu (STOC 2025, COLT 2025) for mixtures of distributions satisfying Poincaré inequalities. 2. Answering an open question posed by Blanca et al., we give efficient identity-testers for mixtures of ATE distributions in the coordinate-conditional sampling access model. We also give some simplifications and improvements to the original algorithm of Blanca et al.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes