CVMay 27, 2023

Condition-Invariant Semantic Segmentation

Christos Sakaridis, David Bruggemann, Fisher Yu, Luc Van Gool

arXiv:2305.17349v48.418 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses robust perception for autonomous cars and robots by improving condition-level adaptation, though it builds incrementally on existing domain adaptation methods.

The paper tackles the problem of adapting semantic segmentation networks to different visual conditions, such as daytime-to-nighttime, by proposing a feature invariance loss that aligns features from original and stylized images, achieving state-of-the-art results on benchmarks like Cityscapes→Dark Zurich and competitive performance on Cityscapes→ACDC.

Adaptation of semantic segmentation networks to different visual conditions is vital for robust perception in autonomous cars and robots. However, previous work has shown that most feature-level adaptation methods, which employ adversarial training and are validated on synthetic-to-real adaptation, provide marginal gains in condition-level adaptation, being outperformed by simple pixel-level adaptation via stylization. Motivated by these findings, we propose to leverage stylization in performing feature-level adaptation by aligning the internal network features extracted by the encoder of the network from the original and the stylized view of each input image with a novel feature invariance loss. In this way, we encourage the encoder to extract features that are already invariant to the style of the input, allowing the decoder to focus on parsing these features and not on further abstracting from the specific style of the input. We implement our method, named Condition-Invariant Semantic Segmentation (CISS), on the current state-of-the-art domain adaptation architecture and achieve outstanding results on condition-level adaptation. In particular, CISS sets the new state of the art in the popular daytime-to-nighttime Cityscapes$\to$Dark Zurich benchmark. Furthermore, our method achieves the second-best performance on the normal-to-adverse Cityscapes$\to$ACDC benchmark. CISS is shown to generalize well to domains unseen during training, such as BDD100K-night and ACDC-night. Code is publicly available at https://github.com/SysCV/CISS .

View on arXiv PDF Code

Similar