CVIMLGApr 20, 2022

Hephaestus: A large scale multitask dataset towards InSAR understanding

arXiv:2204.09435v120 citationsh-index: 22
Originality Synthesis-oriented
AI Analysis

This addresses the barrier for researchers in remote sensing and geophysics to apply deep learning to InSAR data, though it is incremental as it provides a dataset rather than a new method.

The authors tackled the lack of large annotated datasets for InSAR data by creating Hephaestus, a manually annotated dataset with 19,919 interferograms and 216,106 patches from 44 volcanoes, enabling various computer vision tasks.

Synthetic Aperture Radar (SAR) data and Interferometric SAR (InSAR) products in particular, are one of the largest sources of Earth Observation data. InSAR provides unique information on diverse geophysical processes and geology, and on the geotechnical properties of man-made structures. However, there are only a limited number of applications that exploit the abundance of InSAR data and deep learning methods to extract such knowledge. The main barrier has been the lack of a large curated and annotated InSAR dataset, which would be costly to create and would require an interdisciplinary team of experts experienced on InSAR data interpretation. In this work, we put the effort to create and make available the first of its kind, manually annotated dataset that consists of 19,919 individual Sentinel-1 interferograms acquired over 44 different volcanoes globally, which are split into 216,106 InSAR patches. The annotated dataset is designed to address different computer vision problems, including volcano state classification, semantic segmentation of ground deformation, detection and classification of atmospheric signals in InSAR imagery, interferogram captioning, text to InSAR generation, and InSAR image quality assessment.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes