100.0STMar 29
Learning general conditional independence structures via the neighbourhood latticeArash A. Amini, Bryon Aragam, Qing Zhou
This work addresses the problem of learning multivariate dependencies in nonparametric and high-dimensional settings, offering a unified approach that works without faithfulness and avoids the curse of dimensionality.
99.7DSMar 24
Algorithmic warm starts for Hamiltonian Monte CarloMatthew S. Zhang, Jason M. Altschuler, Sinho Chewi
This resolves the computational bottleneck of finding warm starts for HMC, which is crucial for practitioners in statistics, engineering, and sciences who rely on HMC for high-dimensional sampling, though it is incremental as it builds on prior theoretical work.
98.7MLMay 13
What is Learnable in Valiant's Theory of the Learnable?Steve Hanneke, Anay Mehrotra, Grigoris Velegkas et al.
Provides a theoretical characterization and first algorithm for a classic but understudied learning model, clarifying the role of membership queries.
98.8STMay 29
Bayesian Inference with Shaped Deep Non-linear MLPsBoris Hanin, Tianze Jiang
This work provides theoretical insights into the behavior of deep non-linear MLPs for researchers and practitioners interested in understanding the benefits of network depth in Bayesian inference, particularly in large-scale regimes.
97.5STMar 28
Multiple-Prediction-Powered InferenceCharlie Cowen-Breen, Alekh Agarwal, Stephen Bates et al.
Provides a general framework for resource-constrained statistical estimation, improving efficiency for practitioners using multiple proxies.
96.3STMar 23
Stable Algorithms Lower Bounds for EstimationXifan Yu, Ilias Zadik
This work provides rigorous algorithmic foundations for the physics belief that first-order phase transitions impose fundamental limits on efficient algorithms, addressing a long-standing problem in theoretical computer science and statistical estimation.
97.5QUANT-PHApr 16
Cloning is as Hard as Learning for Stabilizer StatesNikhil Bansal, Matthias C. Caro, Gaurav Mahajan
It establishes a fundamental equivalence between cloning and learning for an important class of quantum states, providing a fine-grained perspective on No-Cloning theorems with implications for quantum learning theory and cryptography.
100.0PRJun 3
A remark on the majorizing measures theorem for general processesReese Pathak, Nikita Zhivotovskiy · eth-zurich
This extends a fundamental inequality from Gaussian processes to a much larger family, providing a unified theoretical foundation for empirical process theory.
97.1MLMay 28
Improved Guarantees for Heterogeneous Treatment-Effect Estimation via Matrix CompletionAnay Mehrotra, Phuc Tran, Van H. Vu et al.
For researchers in causal inference and matrix completion, this work offers the first sharp row-wise ℓ2 perturbation bound, enabling per-unit treatment effect estimation where previous methods only gave average bounds.
96.7MLMay 28
Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal DistributionsJingda Wu, Changxiao Cai
Provides the first rigorous theoretical justification for diffusion models' ability to adapt to low-dimensional structure and multi-modality, addressing a key gap in understanding their empirical success.
95.8LGMay 17
Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right SpaceKelvin Kan, Xingjian Li, Benjamin J. Zhang et al.
This work provides the first convergence theory for discrete diffusion models that scales to large vocabularies (e.g., hundreds of thousands of tokens) by removing the state-space-size dependence that made prior bounds vacuous for modern language tasks.
95.7LGApr 12
Query Lower Bounds for Diffusion SamplingZhiyang Xun, Eric Price
Provides fundamental limits for diffusion sampling acceleration, relevant to researchers designing faster sampling algorithms.
95.1STMay 26Code
Robust Moment-Based Estimation via Spectral Gradient ReweightingLiu Zhang, Amit Singer
For practitioners using moment-based estimation in the presence of outliers, this work provides a theoretically grounded robust method with finite-sample guarantees.
94.5MLMay 30
Taming the Loss Landscape of PINNs with Noisy Feynman-Kac Supervision: Operator Preconditioning and Non-Asymptotic Error BoundsNathanael Tepakbong, Hanyu Hu, Chengyu Liu et al.
For researchers using PINNs to solve PDEs, this work provides a theoretically grounded method to mitigate training difficulties caused by ill-conditioned loss landscapes, with practical improvements on several benchmark problems.
98.8PRMar 31
Shifted Composition IV: Toward Ballistic Acceleration for Log-Concave SamplingJason M. Altschuler, Sinho Chewi, Matthew S. Zhang
This work addresses a major open problem in computational statistics and machine learning by enabling faster sampling algorithms for log-concave distributions, which is incremental but crucial for applications like Bayesian inference.
95.2DSMar 26
The Geometry of Efficient Nonconvex SamplingSantosh S. Vempala, Andre Wibisono
This provides a substantial generalization of sampling methods for convex and star-shaped bodies, addressing a fundamental challenge in computational geometry and statistics.
94.4MLApr 14
Identifiability of Potentially Degenerate Gaussian Mixture Models With Piecewise Affine MixingDanru Xu, Sébastien Lachapelle, Sara Magliacane
For researchers in causal representation learning, this work extends identifiability guarantees to degenerate Gaussian mixtures, addressing a challenging setting where standard density-based methods fail.
94.2MLMay 21
Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting ModelsKrishnakumar Balasubramanian
This work provides theoretical convergence guarantees for a new class of generative models, addressing conservatism issues in drifting methods, but the rates are limited by curse of dimensionality.
93.9MLMar 24
Wasserstein Parallel Transport for Predicting the Dynamics of Statistical SystemsTristan Luca Saidi, Gonzalo Mena, Larry Wasserman et al.
This addresses the challenge of modeling distributional dynamics in scientific systems where classical vector space methods fail, offering a novel approach for researchers in fields like biology and economics.
95.0OCMay 16
High-dimensional Limit of SGD for Diagonal Linear NetworksBegoña García Malaxechebarría, Courtney Paquette, Maryam Fazel et al.
Provides a rigorous theoretical framework for understanding SGD dynamics in a simplified neural network setting, offering explicit non-asymptotic convergence guarantees.