CVJul 16, 2025

InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization

arXiv:2507.12420v23 citationsh-index: 2
Originality Incremental advance
AI Analysis

This addresses localization accuracy issues in object detection, particularly for small objects, but is incremental as it builds on existing IoU-based loss methods.

The paper tackles the problem of suboptimal bounding box regression in object detection by proposing InterpIoU, a loss function that uses interpolated boxes to provide meaningful gradients and avoid issues like box enlargement, resulting in consistent performance improvements across datasets like COCO, VisDrone, and PASCAL VOC, with notable gains in small object detection.

Bounding box regression (BBR) is fundamental to object detection, where the regression loss is crucial for accurate localization. Existing IoU-based losses often incorporate handcrafted geometric penalties to address IoU's non-differentiability in non-overlapping cases and enhance BBR performance. However, these penalties are sensitive to box shape, size, and distribution, often leading to suboptimal optimization for small objects and undesired behaviors such as bounding box enlargement due to misalignment with the IoU objective. To address these limitations, we propose InterpIoU, a novel loss function that replaces handcrafted geometric penalties with a term based on the IoU between interpolated boxes and the target. By using interpolated boxes to bridge the gap between predictions and ground truth, InterpIoU provides meaningful gradients in non-overlapping cases and inherently avoids the box enlargement issue caused by misaligned penalties. Simulation results further show that IoU itself serves as an ideal regression target, while existing geometric penalties are both unnecessary and suboptimal. Building on InterpIoU, we introduce Dynamic InterpIoU, which dynamically adjusts interpolation coefficients based on IoU values, enhancing adaptability to scenarios with diverse object distributions. Experiments on COCO, VisDrone, and PASCAL VOC show that our methods consistently outperform state-of-the-art IoU-based losses across various detection frameworks, with particularly notable improvements in small object detection, confirming their effectiveness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes