100.0OCMar 23
Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy GamesClément Hongler, Franck Gabriel, Valentin Hartmann et al.
This work addresses the open problem of developing general AI capabilities through automated curriculum learning, potentially impacting the entire field of artificial intelligence if successful.
99.8OCApr 2
Optimal Projection-Free Adaptive SGD for Matrix OptimizationDmitry Kovalev
This work addresses incremental improvements in optimization algorithms for machine learning, specifically for matrix optimization problems.
99.8SYMar 27
Stabilizing a linear system using phone calls when time is informationMohammad Javad Khojasteh, Massimo Franceschetti, Gireeja Ranade
This work addresses the challenge of event-triggering control with zero-payload rate for systems like phone call networks, generalizing previous strategies and revealing fundamental limits in using timing information for stabilization.
99.6OCMar 19
Leader-following Consensus over Jointly Connected Switching Networks is Achievable for Exponentially Unstable Linear SystemsYuhan Chen, Tao Liu, Jie Huang
This solves a long-standing challenge in control theory for multi-agent systems, enabling applications like robotics and autonomous vehicles with unstable dynamics, though it is a foundational breakthrough rather than incremental.
99.4OCMar 21
SPABA: A Single-Loop and Probabilistic Stochastic Bilevel Algorithm Achieving Optimal Sample ComplexityTianshu Chu, Dachuan Xu, Wei Yao et al.
This resolves a theoretical gap in machine learning optimization, providing efficient algorithms for large-scale nested problems, though it is incremental as it adapts an existing method to a specific setting.
99.3OCApr 28Code
From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization ModelingJianghao Lin, Zi Ling, Chenyu Zhou et al.
For practitioners needing reliable optimization modeling from natural language, Agora-Opt provides a practical and extensible foundation that improves over existing methods.
99.7SYMar 24
Universal Formula Families for Safe Stabilization of Single-Input Nonlinear SystemsBo Wang, Miroslav Krstic
This work provides explicit constructive alternatives to CLF-CBF quadratic programming for scalar-input nonlinear systems, which is incremental as it builds on existing universal stabilizer formulas.
99.1OCMay 29
Diffusion-Robust Optimization over GraphsLiviu Aolaritei, Ricky Huang, Michael I. Jordan et al.
This work provides insights into the computational landscape of robust graph optimization under a topology-aware uncertainty model, which is relevant for researchers and practitioners in networked systems like transportation and logistics.
98.9OCApr 1
Implicit Primal-Dual Interior-Point Methods for Quadratic ProgrammingJon Arrizabalaga, Zachary Manchester
This addresses a key limitation in solving large-scale quadratic programs to high precision, offering potential improvements in computational efficiency and accuracy.
100.0DSMay 30
Continuous Data Assimilation with Learned Surrogate DynamicsWenwen Li, Daniel Sanz-Alonso
For practitioners of data assimilation, this work provides theoretical guarantees for using machine learning surrogates in nudging algorithms, addressing model error when true dynamics are unknown or expensive.
98.7OCMay 18
Symmetry-Compatible Principle for Optimizer Design: Embeddings, LM Heads, SwiGLU MLPs, and MoE RoutersTim Tsz-Kit Lau, Weijie Su
This work addresses the geometric disparity between neural network symmetries and coordinate-wise optimizers, offering a principled framework for designing layerwise optimizers that improve training of large language models.
99.2SYMay 1
Distributed Coordination of Grid-Forming and Grid-Following Inverters for Optimal Frequency Control in Power SystemsXiaoyang Wang, Xin Chen
For power system operators, this work provides a scalable, distributed solution to frequency control in inverter-dominated grids, addressing a critical challenge in renewable integration.
99.0SYApr 14
Symmetry Is Almost All You Need: Robust Stability with Uncertainty Induced by Symmetric SRG RegionsDing Zhang, Di Zhao, Philipp Braun et al.
Provides theoretical foundations for robust stability analysis of MIMO LTI systems under symmetric uncertainties, offering a unified framework that generalizes existing conditions.
98.5OCApr 28
On Finding Small Hyper-Gradients in Bilevel Optimization: Hardness Results and Improved AnalysisLesi Chen, Jing Xu, Jingzhao Zhang
For researchers in bilevel optimization, this work clarifies the boundary between tractable and intractable problems and provides optimal algorithms for a practically relevant class of nonconvex-nonconvex problems.
97.7COMay 10
Stable Set Polytopes with Rank $|V(G)|/3$ for the Lovász--Schrijver SDP OperatorYu Hin Au, Levent Tunçel
For researchers in combinatorial optimization and semidefinite programming, this resolves a long-standing open problem about the exact size of graphs achieving a given LS_+ rank.
98.9NAMar 17
The peak heat flux conjecture for the first Dirichlet eigenmode of convex planar domainsZijian Wang, Jeremy G. Hoskins, Manas Rachh et al.
This addresses a theoretical problem in mathematical physics for researchers studying heat diffusion and spectral geometry, with incremental analytical support for the conjecture.
98.3OCMay 18
Scale-Invariant Neural Network Optimization: Norm Geometry and Heavy-Tailed NoiseJiayu Zhang, Tianyi Lin
For researchers designing optimizers for deep neural networks, this work provides theoretical foundations and practical algorithms for scale-invariant methods under realistic heavy-tailed noise.
98.4DSMar 26
Global Stability Analysis of the Age-Structured Chemostat With Substrate DynamicsIasson Karafyllis, Dionysios Theodosis, Miroslav Krstic
This work addresses stability in complex biological systems like chemostats, providing rigorous mathematical tools for researchers in mathematical biology and control theory, though it is incremental as it builds on existing models with a new analytical approach.
97.4NIMay 9
Locational Pricing for Generative-AI Services via Token-Flow Market ClearingShaohui Liu
This work addresses the emerging problem of efficient dispatch and pricing for geographically distributed AI service infrastructure, providing a foundational market-clearing framework for future competitive markets.
98.1OCMay 13
Proximal-Based Generative Modeling for Bayesian Inverse ProblemsBoyang Zhang, Zhiguo Wang, Ya-Feng Liu
This work provides a novel solution to the likelihood score intractability in score-based diffusion models for inverse problems, benefiting applications in imaging and scientific computing.