Cross-Modal Reconstruction Pretraining for Ramp Flow Prediction at Highway Interchanges
This addresses traffic prediction challenges for highway management by overcoming detector scarcity, though it is incremental as it builds on existing forecasting methods.
The paper tackled the problem of predicting ramp flows at highway interchanges without real-time ramp detectors by proposing a cross-modal reconstruction pretraining framework, achieving performance comparable to models using historical ramp data and outperforming thirteen state-of-the-art baselines on three real-world datasets.
Interchanges are crucial nodes for vehicle transfers between highways, yet the lack of real-time ramp detectors creates blind spots in traffic prediction. To address this, we propose a Spatio-Temporal Decoupled Autoencoder (STDAE), a two-stage framework that leverages cross-modal reconstruction pretraining. In the first stage, STDAE reconstructs historical ramp flows from mainline data, forcing the model to capture intrinsic spatio-temporal relations. Its decoupled architecture with parallel spatial and temporal autoencoders efficiently extracts heterogeneous features. In the prediction stage, the learned representations are integrated with models such as GWNet to enhance accuracy. Experiments on three real-world interchange datasets show that STDAE-GWNET consistently outperforms thirteen state-of-the-art baselines and achieves performance comparable to models using historical ramp data. This demonstrates its effectiveness in overcoming detector scarcity and its plug-and-play potential for diverse forecasting pipelines.