Unsupervised model-free representation learning
This addresses the challenge of unsupervised representation learning for control and learning problems with limited feedback, though it appears incremental as it builds on existing information theory concepts.
The paper tackles the problem of learning representations from high-dimensional, dependent data without feedback by maximizing a time-series information criterion, showing that the optimal function preserves original dependencies and is unique and consistent.
Numerous control and learning problems face the situation where sequences of high-dimensional highly dependent data are available but no or little feedback is provided to the learner, which makes any inference rather challenging. To address this challenge, we formulate the following problem. Given a series of observations $X_0,\dots,X_n$ coming from a large (high-dimensional) space $\mathcal X$, find a representation function $f$ mapping $\mathcal X$ to a finite space $\mathcal Y$ such that the series $f(X_0),\dots,f(X_n)$ preserves as much information as possible about the original time-series dependence in $X_0,\dots,X_n$. We show that, for stationary time series, the function $f$ can be selected as the one maximizing a certain information criterion that we call time-series information. Some properties of this functions are investigated, including its uniqueness and consistency of its empirical estimates. Implications for the problem of optimal control are presented.