CVSep 6, 2022

A Multitask Deep Learning Model for Parsing Bridge Elements and Segmenting Defect in Bridge Inspection Images

Chenyu Zhang, Muhammad Monjurul Karim, Ruwen Qin

arXiv:2209.02190v23.719 citationsh-index: 24Has Code

Originality Incremental advance

AI Analysis

This addresses the high cost of manual bridge inspections for infrastructure maintenance, though it appears incremental as it builds on existing deep learning approaches.

The paper tackles the problem of automating bridge inspection by developing a multitask deep learning model that simultaneously parses bridge elements and segments defects, achieving 2.59% higher mIoU on element parsing and 1.65% on corrosion segmentation compared to single-task models.

The vast network of bridges in the United States raises a high requirement for maintenance and rehabilitation. The massive cost of manual visual inspection to assess bridge conditions is a burden to some extent. Advanced robots have been leveraged to automate inspection data collection. Automating the segmentations of multiclass elements and surface defects on the elements in the large volume of inspection image data would facilitate an efficient and effective assessment of the bridge condition. Training separate single-task networks for element parsing (i.e., semantic segmentation of multiclass elements) and defect segmentation fails to incorporate the close connection between these two tasks. Both recognizable structural elements and apparent surface defects are present in the inspection images. This paper is motivated to develop a multitask deep learning model that fully utilizes such interdependence between bridge elements and defects to boost the model's task performance and generalization. Furthermore, the study investigated the effectiveness of the proposed model designs for improving task performance, including feature decomposition, cross-talk sharing, and multi-objective loss function. A dataset with pixel-level labels of bridge elements and corrosion was developed for model training and testing. Quantitative and qualitative results from evaluating the developed multitask deep model demonstrate its advantages over the single-task-based model not only in performance (2.59% higher mIoU on bridge parsing and 1.65% on corrosion segmentation) but also in computational time and implementation capability.

View on arXiv PDF Code

Similar