CVNov 18, 2024

Transmission Line Defect Detection Based on UAV Patrol Images and Vision-language Pretraining

arXiv:2411.11370v2h-index: 7
Originality Incremental advance
AI Analysis

This work addresses a critical task in transmission line monitoring for utility companies, but it is incremental as it builds on existing vision-language pretraining techniques tailored to a specific domain.

The paper tackles the problem of detecting defects in transmission lines from UAV patrol images, which often lack sufficient visual information due to imaging conditions, and proposes a method based on vision-language pretraining and progressive transfer, resulting in significantly improved detection accuracy.

Unmanned aerial vehicle (UAV) patrol inspection has emerged as a predominant approach in transmission line monitoring owing to its cost-effectiveness. Detecting defects in transmission lines is a critical task during UAV patrol inspection. However, due to imaging distance and shooting angles, UAV patrol images often suffer from insufficient defect-related visual information, which has an adverse effect on detection accuracy. In this article, we propose a novel method for detecting defects in UAV patrol images, which is based on vision-language pretraining for transmission line (VLP-TL) and a progressive transfer strategy (PTS). Specifically, VLP-TL contains two novel pretraining tasks tailored for the transmission line scenario, aimimg at pretraining an image encoder with abundant knowledge acquired from both visual and linguistic information. Transferring the pretrained image encoder to the defect detector as its backbone can effectively alleviate the insufficient visual information problem. In addition, the PTS further improves transfer performance by progressively bridging the gap between pretraining and downstream defection detection. Experimental results demonstrate that the proposed method significantly improves defect detection accuracy by jointly utilizing multimodal information, overcoming the limitations of insufficient defect-related visual information provided by UAV patrol images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes