QUANT-PH AI LGNov 20, 2023

Nav-Q: Quantum Deep Reinforcement Learning for Collision-Free Navigation of Self-Driving Cars

Akash Sinha, Antonio Macaluso, Matthias Klusch

arXiv:2311.12875v28.022 citationsh-index: 11Has Code

Originality Incremental advance

AI Analysis

This work addresses the computational inefficiency of deep reinforcement learning for self-driving cars by integrating quantum components, though it is incremental as it builds on existing quantum reinforcement learning approaches.

The authors tackled the collision-free navigation problem for self-driving cars by proposing Nav-Q, a quantum-supported deep reinforcement learning algorithm, which demonstrated improved training stability and sometimes faster convergence compared to classical methods in simulations using the CARLA benchmark.

The task of collision-free navigation (CFN) of self-driving cars is an NP-hard problem usually tackled using Deep Reinforcement Learning (DRL). While DRL methods have proven to be effective, their implementation requires substantial computing resources and extended training periods to develop a robust agent. On the other hand, quantum reinforcement learning has recently demonstrated faster convergence and improved stability in simple, non-real-world environments. In this work, we propose Nav-Q, the first quantum-supported DRL algorithm for CFN of self-driving cars, that leverages quantum computation for improving the training performance without the requirement for onboard quantum hardware. Nav-Q is based on the actor-critic approach, where the critic is implemented using a hybrid quantum-classical algorithm suitable for near-term quantum devices. We assess the performance of Nav-Q using the CARLA driving simulator, a de facto standard benchmark for evaluating state-of-the-art DRL methods. Our empirical evaluations showcase that Nav-Q surpasses its classical counterpart in terms of training stability and, in certain instances, with respect to the convergence rate. Furthermore, we assess Nav-Q in relation to effective dimension, unveiling that the incorporation of a quantum component results in a model with greater descriptive power compared to classical baselines. Finally, we evaluate the performance of Nav-Q using noisy quantum simulation, observing that the quantum noise deteriorates the training performances but enhances the exploratory tendencies of the agent during training.

View on arXiv PDF Code

Similar