ROAILGJan 9, 2025

CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving

arXiv:2501.04982v16 citationsh-index: 1ICAART
Originality Incremental advance
AI Analysis

This addresses safety and generalization challenges in autonomous driving, but it is incremental as it builds on existing DRL and curriculum learning methods.

The paper tackles the problem of poor generalization and lack of transparency in deep reinforcement learning for autonomous driving by combining curriculum learning with PPO and VAE, resulting in improved adaptability and reliability in complex environments.

In autonomous driving, traditional Computer Vision (CV) agents often struggle in unfamiliar situations due to biases in the training data. Deep Reinforcement Learning (DRL) agents address this by learning from experience and maximizing rewards, which helps them adapt to dynamic environments. However, ensuring their generalization remains challenging, especially with static training environments. Additionally, DRL models lack transparency, making it difficult to guarantee safety in all scenarios, particularly those not seen during training. To tackle these issues, we propose a method that combines DRL with Curriculum Learning for autonomous driving. Our approach uses a Proximal Policy Optimization (PPO) agent and a Variational Autoencoder (VAE) to learn safe driving in the CARLA simulator. The agent is trained using two-fold curriculum learning, progressively increasing environment difficulty and incorporating a collision penalty in the reward function to promote safety. This method improves the agent's adaptability and reliability in complex environments, and understand the nuances of balancing multiple reward components from different feedback signals in a single scalar reward function. Keywords: Computer Vision, Deep Reinforcement Learning, Variational Autoencoder, Proximal Policy Optimization, Curriculum Learning, Autonomous Driving.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes