ROCVLGDec 18, 2024

Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction

arXiv:2412.13419v16 citationsh-index: 2
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for autonomous driving safety, focusing on interpretable trajectory prediction models.

The paper tackled vehicle trajectory prediction by integrating Transformer and LSTM models to enhance spatial and temporal feature learning, but the results did not outperform predecessor LSTM-based methods, only demonstrating potential for interpretability.

Accurate vehicle trajectory prediction is crucial for ensuring safe and efficient autonomous driving. This work explores the integration of Transformer based model with Long Short-Term Memory (LSTM) based technique to enhance spatial and temporal feature learning in vehicle trajectory prediction. Here, a hybrid model that combines LSTMs for temporal encoding with a Transformer encoder for capturing complex interactions between vehicles is proposed. Spatial trajectory features of the neighboring vehicles are processed and goes through a masked scatter mechanism in a grid based environment, which is then combined with temporal trajectory of the vehicles. This combined trajectory data are learned by sequential LSTM encoding and Transformer based attention layers. The proposed model is benchmarked against predecessor LSTM based methods, including STA-LSTM, SA-LSTM, CS-LSTM, and NaiveLSTM. Our results, while not outperforming it's predecessor, demonstrate the potential of integrating Transformers with LSTM based technique to build interpretable trajectory prediction model. Future work will explore alternative architectures using Transformer applications to further enhance performance. This study provides a promising direction for improving trajectory prediction models by leveraging transformer based architectures, paving the way for more robust and interpretable vehicle trajectory prediction system.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes