Enhancing Traffic Prediction with Textual Data Using Large Language Models
This addresses the problem of integrating non-numerical data for more accurate traffic prediction in transportation systems, representing an incremental improvement over existing methods.
This study tackled the challenge of incorporating textual contextual information like weather into traffic prediction models by using large language models to process text into embeddings, which were then combined with historical traffic data in traditional spatiotemporal models. The approach improved prediction accuracy on the New York Bike dataset, though specific numerical gains were not provided.
Traffic prediction is pivotal for rational transportation supply scheduling and allocation. Existing researches into short-term traffic prediction, however, face challenges in adequately addressing exceptional circumstances and integrating non-numerical contextual information like weather into models. While, Large language models offer a promising solution due to their inherent world knowledge. However, directly using them for traffic prediction presents drawbacks such as high cost, lack of determinism, and limited mathematical capability. To mitigate these issues, this study proposes a novel approach. Instead of directly employing large models for prediction, it utilizes them to process textual information and obtain embeddings. These embeddings are then combined with historical traffic data and inputted into traditional spatiotemporal forecasting models. The study investigates two types of special scenarios: regional-level and node-level. For regional-level scenarios, textual information is represented as a node connected to the entire network. For node-level scenarios, embeddings from the large model represent additional nodes connected only to corresponding nodes. This approach shows a significant improvement in prediction accuracy according to our experiment of New York Bike dataset.