CVAIMar 14, 2024

Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object Detection

arXiv:2403.09313v116 citationsHas Code
AI Analysis

This work addresses model efficiency and accuracy for underwater object detection, which is incremental as it combines existing techniques like knowledge distillation and transformers in a specific domain.

The paper tackles the problem of object detection in underwater robotics using side-scan sonar images by proposing YOLOX-ViT and applying knowledge distillation to reduce model size while maintaining performance, with results showing reduced false positives in wall detection and improved accuracy due to a visual transformer layer.

In this paper we present YOLOX-ViT, a novel object detection model, and investigate the efficacy of knowledge distillation for model size reduction without sacrificing performance. Focused on underwater robotics, our research addresses key questions about the viability of smaller models and the impact of the visual transformer layer in YOLOX. Furthermore, we introduce a new side-scan sonar image dataset, and use it to evaluate our object detector's performance. Results show that knowledge distillation effectively reduces false positives in wall detection. Additionally, the introduced visual transformer layer significantly improves object detection accuracy in the underwater environment. The source code of the knowledge distillation in the YOLOX-ViT is at https://github.com/remaro-network/KD-YOLOX-ViT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes