CVMay 27, 2021

YOLO5Face: Why Reinventing a Face Detector

arXiv:2105.12931v3229 citationsHas Code
Originality Incremental advance
AI Analysis

This work provides an efficient face detection solution for applications on embedded or mobile devices, but it is incremental as it builds on existing YOLOv5 with targeted optimizations.

The authors tackled face detection by treating it as a generic object detection task, implementing YOLO5Face based on YOLOv5 with modifications like a five-point landmark regression head, and achieved state-of-the-art performance on the WiderFace dataset across Easy, Medium, and Hard subsets.

Tremendous progress has been made on face detection in recent years using convolutional neural networks. While many face detectors use designs designated for detecting faces, we treat face detection as a generic object detection task. We implement a face detector based on the YOLOv5 object detector and call it YOLO5Face. We make a few key modifications to the YOLOv5 and optimize it for face detection. These modifications include adding a five-point landmark regression head, using a stem block at the input of the backbone, using smaller-size kernels in the SPP, and adding a P6 output in the PAN block. We design detectors of different model sizes, from an extra-large model to achieve the best performance to a super small model for real-time detection on an embedded or mobile device. Experiment results on the WiderFace dataset show that on VGA images, our face detectors can achieve state-of-the-art performance in almost all the Easy, Medium, and Hard subsets, exceeding the more complex designated face detectors. The code is available at \url{https://github.com/deepcam-cn/yolov5-face}

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes