stat.THStatistics

Statistics Theory

Mathematical statistics, asymptotic theory

12.6LGMay 15

Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schrödinger Samplers

Bruno Trentini, Dejan Stancevic, Michael M. Bronstein et al.

For practitioners of flow-based generative models with limited inference compute, this work provides a principled, training-free scheduling method that consistently outperforms heuristic grids.

10.0MLMay 28

Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions

Jingda Wu, Changxiao Cai

Provides the first rigorous theoretical justification for diffusion models' ability to adapt to low-dimensional structure and multi-modality, addressing a key gap in understanding their empirical success.

8.0STMay 28

Free Energy Universality in Tensor Estimation via Generic Chaining

Wenxuan Zou, Galen Reeves

This work provides a theoretical understanding of free energy universality for tensor estimation problems, extending prior results from matrix settings to a broader class of models and scaling regimes.

9.9STMar 28

Multiple-Prediction-Powered Inference

Charlie Cowen-Breen, Alekh Agarwal, Stephen Bates et al.

Provides a general framework for resource-constrained statistical estimation, improving efficiency for practitioners using multiple proxies.

11.9MLMay 13

What is Learnable in Valiant's Theory of the Learnable?

Steve Hanneke, Anay Mehrotra, Grigoris Velegkas et al.

Provides a theoretical characterization and first algorithm for a classic but understudied learning model, clarifying the role of membership queries.

10.8LGMay 16

Propagation of Chaos in Contextual Flow Maps

Shi Chen, Zhengjiang Lin, Kaizhao Liu et al.

Provides rigorous statistical guarantees for transformer performance as context length grows, addressing a key theoretical gap for practitioners scaling models.

12.2LGApr 19

Diverse Dictionary Learning

Yujia Zheng, Zijian Li, Shunxing Fan et al.

For practitioners in unsupervised learning, this provides a principled way to recover partial latent structure without unverifiable assumptions, though the results are theoretical and domain-agnostic.

11.7MLApr 14

Identifiability of Potentially Degenerate Gaussian Mixture Models With Piecewise Affine Mixing

Danru Xu, Sébastien Lachapelle, Sara Magliacane

For researchers in causal representation learning, this work extends identifiability guarantees to degenerate Gaussian mixtures, addressing a challenging setting where standard density-based methods fail.

10.5LGMay 8

Scaling Limits of Long-Context Transformers

Giuseppe Bruno, Shi Chen, Zhengjiang Lin et al.

For theorists studying transformer scaling, this provides precise phase transition boundaries and limiting laws, but the analysis is restricted to i.i.d. keys and fixed queries, limiting direct applicability.

9.2OCMay 16

High-dimensional Limit of SGD for Diagonal Linear Networks

Begoña García Malaxechebarría, Courtney Paquette, Maryam Fazel et al.

Provides a rigorous theoretical framework for understanding SGD dynamics in a simplified neural network setting, offering explicit non-asymptotic convergence guarantees.

14.0DSMar 24

Algorithmic warm starts for Hamiltonian Monte Carlo

Matthew S. Zhang, Jason M. Altschuler, Sinho Chewi

This resolves the computational bottleneck of finding warm starts for HMC, which is crucial for practitioners in statistics, engineering, and sciences who rely on HMC for high-dimensional sampling, though it is incremental as it builds on prior theoretical work.

9.8MLMay 14

Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models

Libin Zhu, Damek Davis, Dmitriy Drusvyatskiy et al.

This provides a theoretical explanation for the sample efficiency of iterative kernel methods like Recursive Feature Machines in learning low-dimensional structure from high-dimensional data.

12.5LGApr 12

Query Lower Bounds for Diffusion Sampling

Zhiyang Xun, Eric Price

Provides fundamental limits for diffusion sampling acceleration, relevant to researchers designing faster sampling algorithms.

7.6LGMay 28

Reasoning with Sampling: Cutting at Decision Points

Felix Zhou, Anay Mehrotra, Quanquan C. Liu

For practitioners seeking to improve reasoning in language models without additional training, this work offers a practical sampling method that outperforms prior approaches and RL-trained models.

11.7LGMay 17

Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space

Kelvin Kan, Xingjian Li, Benjamin J. Zhang et al.

This work provides the first convergence theory for discrete diffusion models that scales to large vocabularies (e.g., hundreds of thousands of tokens) by removing the state-space-size dependence that made prior bounds vacuous for modern language tasks.

7.8MLMay 24

Nyström Kernel Stein Discrepancy Tests

Florian Kalinke, Zoltán Szabó, Bharath K. Sriperumbudur

For practitioners needing scalable goodness-of-fit tests on large datasets, this work provides a theoretically grounded acceleration of KSD-based testing without sacrificing statistical performance.

19.5AIJun 3Code1

Hypothesis-Disciplined Multi-Agent Automated Formalization of Asymptotic Statistical Theory

Tingzhou Wei, Zeyu Zheng, Ethan X. Fang et al.

This work addresses the challenge of formalizing complex asymptotic statistical theory for the Lean 4 proof assistant, enabling verified mathematical reasoning in statistics.

9.8AIMay 7

Adaptive auditing of AI systems with anytime-valid guarantees

Siyu Zhou, Patrick Vossler, Venkatesh Sivaraman et al.

For AI auditors and regulators, this work enables statistically valid adaptive testing of AI systems without requiring pre-specified sampling rules, addressing a critical bottleneck in failure mode detection.

9.1ITMay 6

Information-theoretic Limits of Learning and Estimation

Abbas El Gamal, Maxim Raginsky

It serves as a pedagogical resource for students and researchers seeking to understand fundamental limits in machine learning and estimation theory.

8.6STMar 29

Learning general conditional independence structures via the neighbourhood lattice

Arash A. Amini, Bryon Aragam, Qing Zhou

This work addresses the problem of learning multivariate dependencies in nonparametric and high-dimensional settings, offering a unified approach that works without faithfulness and avoids the curse of dimensionality.