LG MLJul 3, 2018

On the Computational Power of Online Gradient Descent

Vaggos Chatziafratis, Tim Roughgarden, Joshua R. Wang

arXiv:1807.01280v24.14 citations

Originality Highly original

AI Analysis

This work addresses the fundamental computational limits of analyzing gradient descent for researchers in machine learning theory and optimization.

The paper demonstrates that online gradient descent can simulate arbitrary polynomial-space computations, even in simple learning settings, and shows that efficiently analyzing its fine-grained behavior is impossible under weak complexity-theoretic assumptions.

We prove that the evolution of weight vectors in online gradient descent can encode arbitrary polynomial-space computations, even in very simple learning settings. Our results imply that, under weak complexity-theoretic assumptions, it is impossible to reason efficiently about the fine-grained behavior of online gradient descent.

View on arXiv PDF

Similar