CV LGMay 7, 2020

Synthetic Image Augmentation for Damage Region Segmentation using Conditional GAN with Structure Edge

Takato Yasuno, Michihiro Nakajima, Tomoharu Sekiguchi, Kazuhiro Noda, Kiyoshi Aoyanagi, Sakura Kato

arXiv:2005.08628v12.34 citationsh-index: 5

Originality Synthesis-oriented

AI Analysis

This addresses the data scarcity and class imbalance problem in infrastructure inspection for maintenance engineers, though it is an incremental improvement using existing GAN methods on a specific domain.

The paper tackles the problem of damage region segmentation in bridge inspection images, where damage regions are extremely rare (0.6-1.5% of pixels), by proposing a synthetic augmentation method using conditional GANs with structure edges to generate additional damaged images. The result shows that retraining segmentation models (FCN-8s, SegNet, DeepLabv3+Xception-v2) with augmented data improves accuracy metrics like mean IoU, damage region IoU, precision, recall, and BF score.

Recently, social infrastructure is aging, and its predictive maintenance has become important issue. To monitor the state of infrastructures, bridge inspection is performed by human eye or bay drone. For diagnosis, primary damage region are recognized for repair targets. But, the degradation at worse level has rarely occurred, and the damage regions of interest are often narrow, so their ratio per image is extremely small pixel count, as experienced 0.6 to 1.5 percent. The both scarcity and imbalance property on the damage region of interest influences limited performance to detect damage. If additional data set of damaged images can be generated, it may enable to improve accuracy in damage region segmentation algorithm. We propose a synthetic augmentation procedure to generate damaged images using the image-to-image translation mapping from the tri-categorical label that consists the both semantic label and structure edge to the real damage image. We use the Sobel gradient operator to enhance structure edge. Actually, in case of bridge inspection, we apply the RC concrete structure with the number of 208 eye-inspection photos that rebar exposure have occurred, which are prepared 840 block images with size 224 by 224. We applied popular per-pixel segmentation algorithms such as the FCN-8s, SegNet, and DeepLabv3+Xception-v2. We demonstrates that re-training a data set added with synthetic augmentation procedure make higher accuracy based on indices the mean IoU, damage region of interest IoU, precision, recall, BF score when we predict test images.

View on arXiv PDF

Similar