LG AI MA MLFeb 15, 2020

Jelly Bean World: A Testbed for Never-Ending Learning

Emmanouil Antonios Platanios, Abulhair Saparov, Tom Mitchell

arXiv:2002.06306v112.827 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses the need for a standardized environment to facilitate research in never-ending learning and general intelligence, though it is incremental as it builds on existing paradigms without introducing new methods.

The authors tackled the lack of a testbed for never-ending learning systems by proposing Jelly Bean World, a freely-available software that provides complex, non-stationary environments for evaluating multi-task and multi-agent algorithms.

Machine learning has shown growing success in recent years. However, current machine learning systems are highly specialized, trained for particular problems or domains, and typically on a single narrow dataset. Human learning, on the other hand, is highly general and adaptable. Never-ending learning is a machine learning paradigm that aims to bridge this gap, with the goal of encouraging researchers to design machine learning systems that can learn to perform a wider variety of inter-related tasks in more complex environments. To date, there is no environment or testbed to facilitate the development and evaluation of never-ending learning systems. To this end, we propose the Jelly Bean World testbed. The Jelly Bean World allows experimentation over two-dimensional grid worlds which are filled with items and in which agents can navigate. This testbed provides environments that are sufficiently complex and where more generally intelligent algorithms ought to perform better than current state-of-the-art reinforcement learning approaches. It does so by producing non-stationary environments and facilitating experimentation with multi-task, multi-agent, multi-modal, and curriculum learning settings. We hope that this new freely-available software will prompt new research and interest in the development and evaluation of never-ending learning systems and more broadly, general intelligence systems.

View on arXiv PDF Code

Similar