LG HEP-THJan 24, 2023

Autonomous particles

arXiv:2301.10077v16.64 citationsh-index: 19

Originality Incremental advance

AI Analysis

This approach could improve efficiency in autonomous driving and similar RL tasks by reducing the learning complexity, though it appears incremental as it builds on symmetry-based methods.

The paper tackles the problem of reinforcement learning in high-dimensional environments with limited actions by proposing to identify relevant information through system symmetries rather than brute-force learning, demonstrating that autonomous vehicles can learn collision-free driving using only four relevant invariants.

Consider a reinforcement learning problem where an agent has access to a very large amount of information about the environment, but it can only take very few actions to accomplish its task and to maximize its reward. Evidently, the main problem for the agent is to learn a map from a very high-dimensional space (which represents its environment) to a very low-dimensional space (which represents its actions). The high-to-low dimensional map implies that most of the information about the environment is irrelevant for the actions to be taken, and only a small fraction of information is relevant. In this paper we argue that the relevant information need not be learned by brute force (which is the standard approach), but can be identified from the intrinsic symmetries of the system. We analyze in details a reinforcement learning problem of autonomous driving, where the corresponding symmetry is the Galilean symmetry, and argue that the learning task can be accomplished with very few relevant parameters, or, more precisely, invariants. For a numerical demonstration, we show that the autonomous vehicles (which we call autonomous particles since they describe very primitive vehicles) need only four relevant invariants to learn how to drive very well without colliding with other particles. The simple model can be easily generalized to include different types of particles (e.g. for cars, for pedestrians, for buildings, for road signs, etc.) with different types of relevant invariants describing interactions between them. We also argue that there must exist a field theory description of the learning system where autonomous particles would be described by fermionic degrees of freedom and interactions mediated by the relevant invariants would be described by bosonic degrees of freedom.

View on arXiv PDF

Similar