CVMay 7, 2025

RAFT -- A Domain Adaptation Framework for RGB & LiDAR Semantic Segmentation

arXiv:2505.04529v41 citationsh-index: 30
Originality Incremental advance
AI Analysis

This work addresses the challenge of deploying segmentation models trained on synthetic data to real-world scenarios, offering incremental improvements for computer vision applications.

The paper tackles the Syn2Real problem in semantic segmentation by proposing RAFT, a domain adaptation framework that uses minimal labeled real-world data, achieving improvements in mIoU such as 2.1% on SYNTHIA->Cityscapes and 1.3% on Cityscapes->ACDC, surpassing the previous state-of-the-art HALO.

Image segmentation is a powerful computer vision technique for scene understanding. However, real-world deployment is stymied by the need for high-quality, meticulously labeled datasets. Synthetic data provides high-quality labels while reducing the need for manual data collection and annotation. However, deep neural networks trained on synthetic data often face the Syn2Real problem, leading to poor performance in real-world deployments. To mitigate the aforementioned gap in image segmentation, we propose RAFT, a novel framework for adapting image segmentation models using minimal labeled real-world data through data and feature augmentations, as well as active learning. To validate RAFT, we perform experiments on the synthetic-to-real "SYNTHIA->Cityscapes" and "GTAV->Cityscapes" benchmarks. We managed to surpass the previous state of the art, HALO. SYNTHIA->Cityscapes experiences an improvement in mIoU* upon domain adaptation of 2.1%/79.9%, and GTAV->Cityscapes experiences a 0.4%/78.2% improvement in mIoU. Furthermore, we test our approach on the real-to-real benchmark of "Cityscapes->ACDC", and again surpass HALO, with a gain in mIoU upon adaptation of 1.3%/73.2%. Finally, we examine the effect of the allocated annotation budget and various components of RAFT upon the final transfer mIoU.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes