CVAIMay 24

Multiscale Real-Time Object Detection in the NMS-Free Era: A Comparative Performance Evaluation of YOLOv8 and YOLO26

arXiv:2605.2483124.8
Predicted impact top 88% in CV · last 90 daysOriginality Synthesis-oriented
AI Analysis

For practitioners selecting real-time object detectors, this work provides a comparative evaluation highlighting that NMS-free design does not guarantee universal superiority and that choice depends on dataset, scale, and hardware.

This paper compares YOLOv8 (NMS-based) and YOLO26 (NMS-free) on Pascal VOC and VisDrone datasets across multiple scales. YOLO26 achieves stronger detection performance and lower model complexity on Pascal VOC, while the gap narrows on VisDrone where both struggle with dense small targets.

Non-Maximum Suppression (NMS) remains a key post-processing step in many real-time object detection pipelines, but it can introduce latency variation and deployment complexity in resource-constrained settings. Recent NMS-free designs such as YOLO26 aim to reduce this dependence through end-to-end detection, yet their performance relative to established NMS-based models such as YOLOv8 remains underexplored beyond standard benchmarks. This paper compares YOLOv8 and YOLO26 on Pascal VOC and VisDrone, representing general object detection and dense aerial small-object detection, respectively. Both model families are evaluated across five scales using accuracy, localization, model size, GFLOPs, and CPU/GPU latency. Results show that YOLO26 achieves stronger detection performance and lower model complexity on Pascal VOC across most scales, while the performance gap narrows on VisDrone, where both models struggle with dense small targets. YOLOv8 remains competitive in GPU latency, showing that NMS-free design does not guarantee universal deployment superiority. Overall, the study shows that detector selection depends on dataset characteristics, object scale, model capacity, and hardware constraints.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes