CVSPNov 16, 2021

SEnSeI: A Deep Learning Module for Creating Sensor Independent Cloud Masks

arXiv:2111.08349v110 citations
Originality Highly original
AI Analysis

This addresses the remote sensing community's challenge of fragmented labeling efforts across diverse sensors, enabling more efficient and widely applicable deep learning models for tasks like on-board processing.

The paper tackles the problem of creating sensor-independent cloud masks for multispectral satellite imagery by introducing SEnSeI, a neural network architecture that generalizes across different sensors. It achieves state-of-the-art performance on trained sensors like Sentinel-2 and Landsat 8 and successfully extrapolates to unseen sensors such as Landsat 7, PerúSat-1, and Sentinel-3 SLSTR, with performance improving when multiple satellites are used in training.

We introduce a novel neural network architecture -- Spectral ENcoder for SEnsor Independence (SEnSeI) -- by which several multispectral instruments, each with different combinations of spectral bands, can be used to train a generalised deep learning model. We focus on the problem of cloud masking, using several pre-existing datasets, and a new, freely available dataset for Sentinel-2. Our model is shown to achieve state-of-the-art performance on the satellites it was trained on (Sentinel-2 and Landsat 8), and is able to extrapolate to sensors it has not seen during training such as Landsat 7, PerúSat-1, and Sentinel-3 SLSTR. Model performance is shown to improve when multiple satellites are used in training, approaching or surpassing the performance of specialised, single-sensor models. This work is motivated by the fact that the remote sensing community has access to data taken with a hugely variety of sensors. This has inevitably led to labelling efforts being undertaken separately for different sensors, which limits the performance of deep learning models, given their need for huge training sets to perform optimally. Sensor independence can enable deep learning models to utilise multiple datasets for training simultaneously, boosting performance and making them much more widely applicable. This may lead to deep learning approaches being used more frequently for on-board applications and in ground segment data processing, which generally require models to be ready at launch or soon afterwards.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes