LGMar 2, 2025

TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

arXiv:2503.01013v342 citationsh-index: 25
Originality Incremental advance
AI Analysis

This work addresses the challenge of leveraging auxiliary modalities for more accurate and interpretable time series predictions, which is incremental as it builds on existing methods by incorporating LLMs.

The authors tackled the problem of multi-modal time series prediction by introducing TimeXL, a framework that integrates a prototype-based encoder with three collaborating LLMs, achieving up to 8.9% improvement in AUC on real-world datasets.

Time series analysis provides essential insights for real-world system dynamics and informs downstream decision-making, yet most existing methods often overlook the rich contextual signals present in auxiliary modalities. To bridge this gap, we introduce TimeXL, a multi-modal prediction framework that integrates a prototype-based time series encoder with three collaborating Large Language Models (LLMs) to deliver more accurate predictions and interpretable explanations. First, a multi-modal prototype-based encoder processes both time series and textual inputs to generate preliminary forecasts alongside case-based rationales. These outputs then feed into a prediction LLM, which refines the forecasts by reasoning over the encoder's predictions and explanations. Next, a reflection LLM compares the predicted values against the ground truth, identifying textual inconsistencies or noise. Guided by this feedback, a refinement LLM iteratively enhances text quality and triggers encoder retraining. This closed-loop workflow-prediction, critique (reflect), and refinement-continuously boosts the framework's performance and interpretability. Empirical evaluations on four real-world datasets demonstrate that TimeXL achieves up to 8.9% improvement in AUC and produces human-centric, multi-modal explanations, highlighting the power of LLM-driven reasoning for time series prediction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes