FlowDistill: Scalable Traffic Flow Prediction via Distillation from LLMs
This addresses the challenge of deploying accurate traffic prediction in data-scarce or resource-limited urban settings, offering a scalable solution with incremental improvements in efficiency.
The paper tackles the problem of traffic flow prediction in resource-constrained environments by proposing FlowDistill, a lightweight framework that uses knowledge distillation from LLMs to guide a compact MLP model. It achieves higher prediction accuracy than state-of-the-art models while requiring less training data and reducing memory usage and inference latency.
Accurate traffic flow prediction is vital for optimizing urban mobility, yet it remains difficult in many cities due to complex spatio-temporal dependencies and limited high-quality data. While deep graph-based models demonstrate strong predictive power, their performance often comes at the cost of high computational overhead and substantial training data requirements, making them impractical for deployment in resource-constrained or data-scarce environments. We propose the FlowDistill, a lightweight and scalable traffic prediction framework based on knowledge distillation from large language models (LLMs). In this teacher-student setup, a fine-tuned LLM guides a compact multi-layer perceptron (MLP) student model using a novel combination of the information bottleneck principle and teacher-bounded regression loss, ensuring the distilled model retains only essential and transferable knowledge. Spatial and temporal correlations are explicitly encoded to enhance the model's generalization across diverse urban settings. Despite its simplicity, FlowDistill consistently outperforms state-of-the-art models in prediction accuracy while requiring significantly less training data, and achieving lower memory usage and inference latency, highlighting its efficiency and suitability for real-world, scalable deployment.