stat.MLStatistics

Machine Learning (Stats)

Statistical machine learning methods

21.8LGMar 16

From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence

Marc Finzi, Shikai Qiu, Yiding Jiang et al. · openai

This work addresses foundational issues in information theory for machine learning practitioners, offering a new framework for data selection and transformation, though it is incremental in building on existing concepts.

22.8LGMar 20

Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD

Emiel Hoogeboom, David Ruhe, Jonathan Heek et al.

This addresses a bottleneck in discrete diffusion models for researchers and practitioners, enabling faster sampling while preserving performance.

19.1MLMar 22

Proximal Point Nash Learning from Human Feedback

Daniil Tiapkin, Daniele Calandriello, Denis Belomestny et al.

This addresses the challenge of accurately capturing complex human preferences in AI alignment, offering a more stable alternative to traditional methods, though it appears incremental as it builds on existing Nash learning frameworks.

40.1LGMay 13

TabPFN-3: Technical Report

Léo Grinsztajn, Klemens Flöge, Oscar Key et al.

For practitioners in science and industry needing fast, accurate tabular prediction, TabPFN-3 provides a foundation model that dominates the speed/performance frontier and scales to large datasets.

21.2MLMar 20Code11

Deep Autocorrelation Modeling for Time-Series Forecasting: Progress and Prospects

Hao Wang, Licheng Pan, Qingsong Wen et al.

It offers a systematic overview for researchers in time-series forecasting, but is incremental as it synthesizes existing literature.

24.7CVMay 18Code

Improved Baselines with Representation Autoencoders

Jaskirat Singh, Boyang Zheng, Zongze Wu et al.

This work provides a practical, training-efficient improvement for generative modeling with diffusion transformers, relevant to researchers in image and video generation.

17.7LGMar 19Code

CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks

Hao Wang, Licheng Pan, Zhichao Chen et al.

This addresses the scalability and cost issues in RLHF for aligning language models, though it is incremental by adapting causal methods to a specific domain.

13.0LGMay 8

RNE: plug-and-play diffusion inference-time control and energy-based training

Jiajun He, José Miguel Hernández-Lobato, Yuanqi Du et al. · cambridge

For practitioners using diffusion models, RNE offers a unified method for density estimation, inference-time control, and energy-based training, but the improvements are incremental over existing techniques.

17.4LGMar 16Code23

Chain-of-Trajectories: Unlocking the Intrinsic Generative Optimality of Diffusion Models via Graph-Theoretic Planning

Ping Chen, Xiang Liu, Xingpeng Zhang et al.

This work addresses the problem of computational inefficiency in diffusion models for AI researchers and practitioners, offering a novel planning-based approach that is incremental in enhancing existing methods.

15.8LGMar 18

Learning to Reason with Curriculum I: Provable Benefits of Autocurriculum

Nived Rajaraman, Audrey Huang, Miro Dudik et al.

This addresses the problem of expensive training for reasoning models in AI, offering a method to reduce costs, though it is incremental as it builds on existing techniques like boosting.

14.6MLMay 25Code

DiscoverPhysics: Benchmarking LLMs for Out-of-the-Box Scientific Thinking

Matt L. Wiemann, Lindsay M. Smith, Peter Melchior et al.

For AI researchers evaluating LLM reasoning, this benchmark reveals that current models struggle with long-horizon experimental design and hypothesis revision, especially when latent variables are involved.

16.4MLApr 14

Discrete Flow Maps

Peter Potaptchik, Jason Yim, Adhi Saravanan et al.

For large language model practitioners, this provides a method to overcome the speed bottleneck of autoregressive generation while maintaining quality, though it is an incremental improvement over existing flow models.

13.4CLApr 9

Synthetic Data for any Differentiable Target

Tristan Thrush, Sung Min Park, Herman Brunborg et al.

This provides a flexible technique for shaping model properties using synthetic data, with potential applications in model customization and control, though it is incremental in advancing RL-based data generation methods.

17.1LGMar 27

Sharp Capacity Scaling of Spectral Optimizers in Learning Associative Memory

Juno Kim, Eshaan Nichani, Denny Wu et al.

This work provides a quantitative understanding of spectral optimizers for researchers in machine learning, though it is incremental as it builds on existing methods in a tractable model.

16.2AIMay 1

Position: agentic AI orchestration should be Bayes-consistent

Theodore Papamarkou, Pierre Alquier, Matthias Bauer et al.

For developers of agentic AI systems, this paper proposes a practical framework to enhance decision-making under uncertainty in orchestration, though it remains a position paper without empirical validation.

15.0LGMay 11

Muon is Not That Special: Random or Inverted Spectra Work Just as Well

Zakhar Shumaylov, Nathaël Da Costa, Peter Zaika et al.

For optimization researchers, this work demystifies the success of Muon, suggesting that geometric narratives may be overemphasized, though the findings are incremental in nature.

14.9LGApr 20

Discrete Tilt Matching

Yuyuan Chen, Shiyi Wang, Peter Potaptchik et al.

This work provides a practical RL fine-tuning method for masked diffusion LLMs, addressing a known bottleneck in training these models.

9.1MLMay 27Code11

DAISI: Data Assimilation with Inverse Sampling using Stochastic Interpolants

Martin Andrae, Erik Wikingsson, So Takao et al.

For scientists and engineers performing data assimilation in complex dynamical systems, DAISI offers a flexible alternative to Gaussian-based methods without requiring retraining at each assimilation step.

16.6MLApr 23

There Will Be a Scientific Theory of Deep Learning

Jamie Simon, Daniel Kunin, Alexander Atanasov et al.

For the deep learning research community, this paper provides a unifying perspective on emerging theoretical work, but it is primarily a synthesis and roadmap rather than a novel contribution.

9.0LGMar 21Code2

Bayesian Scattering: A Principled Baseline for Uncertainty on Image Data

Bernardo Fichera, Zarko Ivkovic, Kjell Jorner et al.

This provides a principled baseline for uncertainty quantification in image data, addressing a gap in the field for interpretable methods, though it is incremental as it adapts existing scattering transforms to a Bayesian framework.