Thermal-GEMs: Generalized Models for Building Thermal Dynamics

Felix Koch, Fabian Raisch, Benjamin Tischler

arXiv:2604.1644380.01 citationsh-index: 2

AI Analysis

For practitioners in building energy efficiency, this provides practical guidance on selecting modeling strategies based on available source building data.

This paper evaluates multi-source transfer learning (TL) and time series foundation models (TSFMs) for building thermal dynamics, finding that multi-source TL reduces forecasting errors by up to 63% compared to single-source TL, but requires 16-32 source buildings with over 1 year of data to consistently outperform TSFMs.

Data-driven models for building thermal dynamics are a scalable approach for enabling energy-efficient operation through fault detection & diagnosis or advanced control. To obtain accurate models, measurement data from a target building spanning months to years are required. Transfer Learning (TL) mitigates this challenge by employing pretrained models based on single or multiple source buildings. General multi-source TL models promise to outperform single-source TL, but alternative multi-source modeling architectures remain to be explored, and evaluation on real-world data is missing. Moreover, time series foundation models (TSFM) have emerged as candidates for the best-performing general models. Hence, we conduct a first, comprehensive assessment of general modeling approaches for building thermal dynamics, including multi-source TL and TSFMs. Our assessment includes ablations using four state-of-the-art multi-source TL architectures and evaluations on synthetic as well as real-world data. We demonstrate that multi-source TL models are highly effective in accurately modeling buildings in real-world applications, yielding up to 63% lower forecasting errors compared to single-source TL. Moreover, our results suggest a trade-off between multi-source TL models exclusively pretrained with building data and TSFMs pretrained with a multitude of different time series, revealing that data from 16-32 source buildings must be available over 1 year for pretraining multi-source TL models to consistently outperform TSFMs as evaluated using the mean absolute error. These findings provide practical guidance for selecting modeling strategies based on the number of source buildings available for pretraining multi-source TL models.

View on arXiv PDF

Similar