LGMay 20, 2025

Time to Embed: Unlocking Foundation Models for Time Series with Channel Descriptions

arXiv:2505.14543v11 citationsh-index: 11
Originality Incremental advance
AI Analysis

This addresses the need for scalable, domain-aware foundation models in time series analysis, moving beyond task-specific approaches, though it builds incrementally on existing Transformer and JEPA architectures.

The paper tackles the problem of developing foundation models for time series data, which have been under-explored compared to other domains like text and vision, by introducing CHARM, a 7M-parameter model that learns transferable representations and achieves state-of-the-art performance across diverse downstream tasks.

Traditional time series models are task-specific and often depend on dataset-specific training and extensive feature engineering. While Transformer-based architectures have improved scalability, foundation models, commonplace in text, vision, and audio, remain under-explored for time series and are largely restricted to forecasting. We introduce $\textbf{CHARM}$, a foundation embedding model for multivariate time series that learns shared, transferable, and domain-aware representations. To address the unique difficulties of time series foundation learning, $\textbf{CHARM}$ incorporates architectural innovations that integrate channel-level textual descriptions while remaining invariant to channel order. The model is trained using a Joint Embedding Predictive Architecture (JEPA), with novel augmentation schemes and a loss function designed to improve interpretability and training stability. Our $7$M-parameter model achieves state-of-the-art performance across diverse downstream tasks, setting a new benchmark for time series representation learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes