Multi-Modal Sensor Fusion for Proactive Blockage Prediction in mmWave Vehicular Networks
This addresses the challenge of maintaining reliable communication in dynamic vehicular environments, though it appears incremental as it builds on existing multi-modal sensing methods.
The paper tackles the problem of signal blockage in mmWave vehicular networks by proposing a proactive blockage prediction framework using multi-modal sensor fusion, achieving up to 97.2% F1-score with inference times as low as 89.8ms for predictions up to 1.5 seconds in advance.
Vehicular communication systems operating in the millimeter wave (mmWave) band are highly susceptible to signal blockage from dynamic obstacles such as vehicles, pedestrians, and infrastructure. To address this challenge, we propose a proactive blockage prediction framework that utilizes multi-modal sensing, including camera, GPS, LiDAR, and radar inputs in an infrastructure-to-vehicle (I2V) setting. This approach uses modality-specific deep learning models to process each sensor stream independently and fuses their outputs using a softmax-weighted ensemble strategy based on validation performance. Our evaluations, for up to 1.5s in advance, show that the camera-only model achieves the best standalone trade-off with an F1-score of 97.1% and an inference time of 89.8ms. A camera+radar configuration further improves accuracy to 97.2% F1 at 95.7ms. Our results display the effectiveness and efficiency of multi-modal sensing for mmWave blockage prediction and provide a pathway for proactive wireless communication in dynamic environments.