CVOct 31, 2025

MapSAM2: Adapting SAM2 for Automatic Segmentation of Historical Map Images and Time Series

arXiv:2510.27547v1h-index: 29
Originality Incremental advance
AI Analysis

This work addresses the problem of constructing linked spatio-temporal datasets from historical maps for applications like dating buildings and analyzing environmental changes, representing an incremental advancement by adapting existing models to a specific domain.

The authors tackled the challenge of automated segmentation of historical map images and time series by developing MapSAM2, a framework that adapts a visual foundation model to treat these as videos, resulting in improved geometric accuracy and effective learning of temporal associations with limited supervision.

Historical maps are unique and valuable archives that document geographic features across different time periods. However, automated analysis of historical map images remains a significant challenge due to their wide stylistic variability and the scarcity of annotated training data. Constructing linked spatio-temporal datasets from historical map time series is even more time-consuming and labor-intensive, as it requires synthesizing information from multiple maps. Such datasets are essential for applications such as dating buildings, analyzing the development of road networks and settlements, studying environmental changes etc. We present MapSAM2, a unified framework for automatically segmenting both historical map images and time series. Built on a visual foundation model, MapSAM2 adapts to diverse segmentation tasks with few-shot fine-tuning. Our key innovation is to treat both historical map images and time series as videos. For images, we process a set of tiles as a video, enabling the memory attention mechanism to incorporate contextual cues from similar tiles, leading to improved geometric accuracy, particularly for areal features. For time series, we introduce the annotated Siegfried Building Time Series Dataset and, to reduce annotation costs, propose generating pseudo time series from single-year maps by simulating common temporal transformations. Experimental results show that MapSAM2 learns temporal associations effectively and can accurately segment and link buildings in time series under limited supervision or using pseudo videos. We will release both our dataset and code to support future research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes