CVAINov 17, 2025

Automated Road Distress Detection Using Vision Transformersand Generative Adversarial Networks

arXiv:2511.13145v1
Originality Synthesis-oriented
AI Analysis

It addresses inefficient road maintenance for infrastructure managers by applying computer vision, but is incremental as it builds on existing methods.

This project tackled road distress segmentation by evaluating synthetic data from GANs and comparing CNN and MaskFormer models, finding that GAN data improves performance and MaskFormer outperforms CNN in mAP50 and IoU metrics.

The American Society of Civil Engineers has graded Americas infrastructure condition as a C, with the road system receiving a dismal D. Roads are vital to regional economic viability, yet their management, maintenance, and repair processes remain inefficient, relying on outdated manual or laser-based inspection methods that are both costly and time-consuming. With the increasing availability of real-time visual data from autonomous vehicles, there is an opportunity to apply computer vision (CV) methods for advanced road monitoring, providing insights to guide infrastructure rehabilitation efforts. This project explores the use of state-of-the-art CV techniques for road distress segmentation. It begins by evaluating synthetic data generated with Generative Adversarial Networks (GANs) to assess its usefulness for model training. The study then applies Convolutional Neural Networks (CNNs) for road distress segmentation and subsequently examines the transformer-based model MaskFormer. Results show that GAN-generated data improves model performance and that MaskFormer outperforms the CNN model in two metrics: mAP50 and IoU.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes