CV AIJan 31, 2024

MOD-CL: Multi-label Object Detection with Constrained Loss

Sota Moriyama, Koji Watanabe, Katsumi Inoue, Akihiro Takemura

arXiv:2403.07885v13.73 citationsh-index: 3Has Code

Originality Synthesis-oriented

AI Analysis

This is an incremental improvement for multi-label object detection systems, addressing output constraint satisfaction.

The paper tackles multi-label object detection by introducing MOD-CL, a framework that uses constrained loss to better satisfy output requirements, showing improvements in scores for both Task 1 (with Corrector and Blender models) and Task 2 (with Product T-Norm integration).

We introduce MOD-CL, a multi-label object detection framework that utilizes constrained loss in the training process to produce outputs that better satisfy the given requirements. In this paper, we use $\mathrm{MOD_{YOLO}}$, a multi-label object detection model built upon the state-of-the-art object detection model YOLOv8, which has been published in recent years. In Task 1, we introduce the Corrector Model and Blender Model, two new models that follow after the object detection process, aiming to generate a more constrained output. For Task 2, constrained losses have been incorporated into the $\mathrm{MOD_{YOLO}}$ architecture using Product T-Norm. The results show that these implementations are instrumental to improving the scores for both Task 1 and Task 2.

View on arXiv PDF Code

Similar