cond-mat.dis-nnPhysics

Disordered Systems & Neural Networks

Neural network theory, spin glasses

99.7MLMar 13

A theory of learning data statistics in diffusion models, from easy to hard

Lorenzo Bardone, Claudia Merger, Sebastian Goldt

This provides foundational insights into how diffusion models learn complex distributions, which is incremental but clarifies a key mechanism for the AI/ML community.

99.5MLJun 2

An Asymptotic Theory of Chain-of-Thought in In-Context Learning

Kaito Takanami, Cengiz Pehlevan

Provides a unified theoretical understanding of how test-time reasoning depth affects generalization in LLMs, addressing a poorly understood scaling behavior.

93.2DIS-NNMay 8

Spectral Dynamics in Deep Networks: Feature Learning, Outlier Escape, and Learning Rate Transfer

Clarissa Lauditi, Cengiz Pehlevan, Blake Bordelon

Provides a theoretical framework for understanding feature learning and hyperparameter transfer in wide neural networks, relevant to practitioners scaling up models.

92.4DIS-NNMar 24

Generative Inversion of Spectroscopic Data for Amorphous Structure Elucidation

Jiawei Guo, Daniel Schwalbe-Koda

This addresses the intricate challenge of structure elucidation in amorphous materials for materials scientists, offering a novel method that bypasses expert guidance or potentials, though it builds on existing generative techniques.

92.0DIS-NNApr 11

A Minimal Model of Representation Collapse: Frustration, Stop-Gradient, and Dynamics

Louie Hong Yao, Yuhao Li, Shengchao Liu

For researchers studying self-supervised learning, this work provides a theoretical understanding of collapse mechanisms and prevention, though the model is highly simplified.

91.8DIS-NNMay 12

The critical slowing down in diffusion models

Luca Maria Del Bono, Giulio Biroli, Patrick Charbonneau et al.

Provides theoretical insight into the limitations of diffusion models near criticality and demonstrates how architectural design can overcome these bottlenecks, relevant for statistical physics and generative modeling.

89.3MLMay 8

Emergence of Distortions in High-Dimensional Guided Diffusion Models

Enrico Ventura, Beatrice Achilli, Luca Ambrogioni et al.

For practitioners using CFG in diffusion models, this work provides a theoretical understanding of diversity loss and a practical fix, though the analysis is limited to Gaussian mixtures.

89.0MLMay 16

A Fourier perspective on the learning dynamics of neural networks: from sample complexities to mechanistic insights

Fabiola Ricci, Claudia Merger, Sebastian Goldt

For researchers studying learning dynamics and generalization in neural networks, this work provides a theoretical and experimental framework linking Fourier properties of data to sample complexity and training speed.

89.4AOMay 23

Memory Uncertainty Relation and Harmonic Memory in Random Recurrent Networks

Taichi Haruna, Kohei Nakajima

This work provides a theoretical framework for understanding memory limits in recurrent neural networks, relevant to reservoir computing and neuromorphic systems, though the results are primarily analytical with limited immediate practical impact.

90.0ETApr 18

A fully parallel densely connected probabilistic Ising machine with inertia for real-time applications

Ruomin Zhu, Abhishek Kumar Singh, Jérémie Laydevant et al.

This work solves a key bottleneck in probabilistic Ising machines—parallel updates—enabling faster solvers for dense optimization problems, with demonstrated utility in real-time wireless communications.

87.5MLMay 22

Asymmetric Scaling Laws from Sparse Features

John Sous, Michael Winer

This work provides a theoretical framework for understanding scaling laws in sparse neural networks, which is important for practitioners designing efficient models under compute constraints.

87.7STR-ELMay 13

Parallel Scan Recurrent Neural Quantum States for Scalable Variational Monte Carlo

Ejaaz Merali, Mohamed Hibat-Allah, Mohammad Kohandel et al.

This work makes recurrent neural network quantum states practical for large-scale quantum many-body simulations, addressing a scalability bottleneck.

83.4MLMay 20

Memorisation, convergence and generalisation in generative models

Antoine Maillard, Sebastian Goldt

This work clarifies the fundamental distinction between convergence and latent factor recovery in generative models, providing theoretical insights for practitioners regarding data requirements and evaluation metrics.

82.8DIS-NNMar 17

Optimality and annealing path planning of dynamical analog solvers

Shu Zhou, K. Y. Michael Wong, Juntao Wang et al.

This work addresses the lack of theoretical insights for practitioners using analog solvers, though it is incremental as it builds on existing dynamical systems approaches.

81.7AIMar 25

When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

Hidenori Tanaka

This provides a framework for understanding social representation formation in multi-agent systems, with implications for AI deployment in decision-making, though it is incremental as it builds on prior naming-game studies.

79.4MLMay 11

Factual recall in linear associative memories: sharp asymptotics and mechanistic insights

Alessio Giorlandino, Sebastian Goldt, Antoine Maillard

This work establishes a fundamental baseline for understanding memory capacity in neural networks, relevant to researchers studying factual recall in large language models.

78.7MLApr 10

Sharp description of local minima in the loss landscape of high-dimensional two-layer ReLU neural networks

Jie Huang, Bruno Loureiro, Stefano Sarao Mannelli

This provides foundational insights into optimization challenges in neural networks, addressing a core problem for machine learning researchers.

78.8DIS-NNMar 31

Strong Low Degree Hardness for Stable Local Optima in Spin Glasses

Brice Huang, Mark Sellke

This work provides theoretical evidence for computational hardness in spin glass optimization, impacting physics and algorithm design, though it is incremental on prior conjectures.

76.6DIS-NNApr 6

Interpretation of Crystal Energy Landscapes with Kolmogorov-Arnold Networks

Gen Zu, Ning Mao, Claudia Felser et al.

This addresses the need for transparent, chemistry-based materials informatics to generate scientific insights beyond black-box predictions, representing a new paradigm rather than an incremental improvement.

76.9ETMay 31

Probabilistic Computers for MIMO Detection: From Sparsification to 2D Parallel Tempering

M Mahmudul Hasan Sajeeb, Kevin Callahan-Coray, Corentin Delacour et al.

This work addresses the dense connectivity bottleneck in probabilistic computers for real-world combinatorial optimization, offering a tuning-free algorithmic framework that could enable scalable hardware for next-generation wireless MIMO detection.