CVOct 22, 2025

Mitigating representation bias caused by missing pixels in methane plume detection

Julia Wąsala, Joannes D. Maasakkers, Ilse Aben, Rochelle Schneider, Holger Hoos, Mitra Baratchi

arXiv:2510.19478v13.6h-index: 39

Originality Synthesis-oriented

AI Analysis

This addresses a specific bias issue in environmental monitoring for researchers and practitioners, but it is incremental as it applies known techniques to a new domain.

The paper tackled representation bias in methane plume detection from satellite images caused by missing pixels, showing that imputation and weighted resampling reduce bias without harming accuracy, and debiased models improve plume detection in low-coverage images.

Most satellite images have systematically missing pixels (i.e., missing data not at random (MNAR)) due to factors such as clouds. If not addressed, these missing pixels can lead to representation bias in automated feature extraction models. In this work, we show that spurious association between the label and the number of missing values in methane plume detection can cause the model to associate the coverage (i.e., the percentage of valid pixels in an image) with the label, subsequently under-detecting plumes in low-coverage images. We evaluate multiple imputation approaches to remove the dependence between the coverage and a label. Additionally, we propose a weighted resampling scheme during training that removes the association between the label and the coverage by enforcing class balance in each coverage bin. Our results show that both resampling and imputation can significantly reduce the representation bias without hurting balanced accuracy, precision, or recall. Finally, we evaluate the capability of the debiased models using these techniques in an operational scenario and demonstrate that the debiased models have a higher chance of detecting plumes in low-coverage images.

View on arXiv PDF

Similar