CVFeb 22

A Two-Stage Detection-Tracking Framework for Stable Apple Quality Inspection in Dense Conveyor-Belt Environments

arXiv:2602.19278v1h-index: 4
Originality Synthesis-oriented
AI Analysis

This addresses the need for reliable automated fruit grading systems in industrial settings, though it is incremental as it integrates existing methods like YOLOv8 and ByteTrack for a specific application.

The paper tackled the problem of unstable apple quality inspection in dense conveyor-belt environments by proposing a two-stage detection-tracking framework, resulting in improved stability compared to frame-wise inference with metrics like track-level defect ratio and temporal consistency.

Industrial fruit inspection systems must operate reliably under dense multi-object interactions and continuous motion, yet most existing works evaluate detection or classification at the image level without ensuring temporal stability in video streams. We present a two-stage detection-tracking framework for stable multi-apple quality inspection in conveyor-belt environments. An orchard-trained YOLOv8 model performs apple localization, followed by ByteTrack multi-object tracking to maintain persistent identities. A ResNet18 defect classifier, fine-tuned on a healthy-defective fruit dataset, is applied to cropped apple regions. Track-level aggregation is introduced to enforce temporal consistency and reduce prediction oscillation across frames. We define video-level industrial metrics such as track-level defect ratio and temporal consistency to evaluate system robustness under realistic processing conditions. Results demonstrate improved stability compared to frame-wise inference, suggesting that integrating tracking is essential for practical automated fruit grading systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes