CLLGJan 3, 2025

Time Series Language Model for Descriptive Caption Generation

arXiv:2501.01832v110 citationsh-index: 2Eng appl artif intell
Originality Highly original
AI Analysis

This addresses the data scarcity issue in time series captioning, enhancing interpretability and utility for domains relying on temporal data analysis.

The paper tackles the problem of generating natural language descriptions for time series data, which is under-explored in large language models, by introducing TSLM, a novel encoder-decoder model that outperforms existing state-of-the-art approaches by a significant margin.

The automatic generation of representative natural language descriptions for observable patterns in time series data enhances interpretability, simplifies analysis and increases cross-domain utility of temporal data. While pre-trained foundation models have made considerable progress in natural language processing (NLP) and computer vision (CV), their application to time series analysis has been hindered by data scarcity. Although several large language model (LLM)-based methods have been proposed for time series forecasting, time series captioning is under-explored in the context of LLMs. In this paper, we introduce TSLM, a novel time series language model designed specifically for time series captioning. TSLM operates as an encoder-decoder model, leveraging both text prompts and time series data representations to capture subtle temporal patterns across multiple phases and generate precise textual descriptions of time series inputs. TSLM addresses the data scarcity problem in time series captioning by first leveraging an in-context prompting synthetic data generation, and second denoising the generated data via a novel cross-modal dense retrieval scoring applied to time series-caption pairs. Experimental findings on various time series captioning datasets demonstrate that TSLM outperforms existing state-of-the-art approaches from multiple data modalities by a significant margin.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes