LGAIROAug 1, 2023

PeRP: Personalized Residual Policies For Congestion Mitigation Through Co-operative Advisory Systems

arXiv:2308.00864v26 citationsh-index: 16
Originality Incremental advance
AI Analysis

This addresses traffic congestion for human drivers in mixed-autonomy scenarios, representing an incremental improvement over existing piecewise constant policies.

The paper tackles the problem of traffic congestion mitigation by developing a cooperative advisory system that provides personalized driving recommendations to human drivers, achieving 4-22% improvement in average speed over baselines.

Intelligent driving systems can be used to mitigate congestion through simple actions, thus improving many socioeconomic factors such as commute time and gas costs. However, these systems assume precise control over autonomous vehicle fleets, and are hence limited in practice as they fail to account for uncertainty in human behavior. Piecewise Constant (PC) Policies address these issues by structurally modeling the likeness of human driving to reduce traffic congestion in dense scenarios to provide action advice to be followed by human drivers. However, PC policies assume that all drivers behave similarly. To this end, we develop a co-operative advisory system based on PC policies with a novel driver trait conditioned Personalized Residual Policy, PeRP. PeRP advises drivers to behave in ways that mitigate traffic congestion. We first infer the driver's intrinsic traits on how they follow instructions in an unsupervised manner with a variational autoencoder. Then, a policy conditioned on the inferred trait adapts the action of the PC policy to provide the driver with a personalized recommendation. Our system is trained in simulation with novel driver modeling of instruction adherence. We show that our approach successfully mitigates congestion while adapting to different driver behaviors, with 4 to 22% improvement in average speed over baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes