LGMLNov 18, 2019

Comments on the Du-Kakade-Wang-Yang Lower Bounds

arXiv:1911.07910v146 citations
Originality Synthesis-oriented
AI Analysis

This work clarifies theoretical inconsistencies in reinforcement learning theory, which is important for researchers in machine learning and AI, but it is incremental as it synthesizes existing results.

The paper addresses conflicting results on the tractability of reinforcement learning with misspecified representations, comparing lower bounds on sample complexity by Du et al. with tractability findings based on the eluder dimension to reconcile interpretations.

Du, Kakade, Wang, and Yang recently established intriguing lower bounds on sample complexity, which suggest that reinforcement learning with a misspecified representation is intractable. Another line of work, which centers around a statistic called the eluder dimension, establishes tractability of problems similar to those considered in the Du-Kakade-Wang-Yang paper. We compare these results and reconcile interpretations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes