CVROJul 24, 2024

Context-aware Multi-task Learning for Pedestrian Intent and Trajectory Prediction

arXiv:2407.17162v17 citationsh-index: 14
Originality Incremental advance
AI Analysis

This work addresses a critical safety challenge for autonomous vehicles in urban environments by improving pedestrian behavior modeling, though it is incremental as it builds on existing multi-task learning approaches.

The paper tackles the problem of predicting pedestrian trajectory and intention for autonomous vehicles by introducing PTINet, a multi-task learning model that integrates past trajectories with local and global contextual features, achieving superior performance on JAAD and PIE datasets over state-of-the-art methods.

The advancement of socially-aware autonomous vehicles hinges on precise modeling of human behavior. Within this broad paradigm, the specific challenge lies in accurately predicting pedestrian's trajectory and intention. Traditional methodologies have leaned heavily on historical trajectory data, frequently overlooking vital contextual cues such as pedestrian-specific traits and environmental factors. Furthermore, there's a notable knowledge gap as trajectory and intention prediction have largely been approached as separate problems, despite their mutual dependence. To bridge this gap, we introduce PTINet (Pedestrian Trajectory and Intention Prediction Network), which jointly learns the trajectory and intention prediction by combining past trajectory observations, local contextual features (individual pedestrian behaviors), and global features (signs, markings etc.). The efficacy of our approach is evaluated on widely used public datasets: JAAD and PIE, where it has demonstrated superior performance over existing state-of-the-art models in trajectory and intention prediction. The results from our experiments and ablation studies robustly validate PTINet's effectiveness in jointly exploring intention and trajectory prediction for pedestrian behaviour modelling. The experimental evaluation indicates the advantage of using global and local contextual features for pedestrian trajectory and intention prediction. The effectiveness of PTINet in predicting pedestrian behavior paves the way for the development of automated systems capable of seamlessly interacting with pedestrians in urban settings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes