LGMLMar 31, 2022

Learning from many trajectories

arXiv:2203.17193v233 citations
Originality Incremental advance
AI Analysis

This addresses a theoretical gap in statistical learning for sequence modeling and control, providing insights into efficiency in multi-trajectory settings, though it is incremental as it builds on existing learning theory.

The paper tackles the problem of supervised learning from multiple independent sequences of non-independent covariates, establishing that for linear least-squares regression, the worst-case error rate changes from Ω(n²/m²T) when m ≲ n to Θ(n/mT) when m ≳ n, showing that with many trajectories, error behaves as if examples were independent.

We initiate a study of supervised learning from many independent sequences ("trajectories") of non-independent covariates, reflecting tasks in sequence modeling, control, and reinforcement learning. Conceptually, our multi-trajectory setup sits between two traditional settings in statistical learning theory: learning from independent examples and learning from a single auto-correlated sequence. Our conditions for efficient learning generalize the former setting--trajectories must be non-degenerate in ways that extend standard requirements for independent examples. Notably, we do not require that trajectories be ergodic, long, nor strictly stable. For linear least-squares regression, given $n$-dimensional examples produced by $m$ trajectories, each of length $T$, we observe a notable change in statistical efficiency as the number of trajectories increases from a few (namely $m \lesssim n$) to many (namely $m \gtrsim n$). Specifically, we establish that the worst-case error rate of this problem is $Θ(n / m T)$ whenever $m \gtrsim n$. Meanwhile, when $m \lesssim n$, we establish a (sharp) lower bound of $Ω(n^2 / m^2 T)$ on the worst-case error rate, realized by a simple, marginally unstable linear dynamical system. A key upshot is that, in domains where trajectories regularly reset, the error rate eventually behaves as if all of the examples were independent, drawn from their marginals. As a corollary of our analysis, we also improve guarantees for the linear system identification problem.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes