CVJan 28, 2025

B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning

arXiv:2501.16917v12 citationsh-index: 6Has Code2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW)
Originality Incremental advance
AI Analysis

This work addresses the need for efficient face detection in applications like mobile or embedded systems, but it is incremental as it builds on existing pruning techniques.

The paper tackles the problem of creating lightweight face detection models for resource-constrained devices by proposing B-FPGM, a pruning pipeline that combines FPGM, SFP, and Bayesian optimization, resulting in superior trade-offs between model size and performance across the WIDER FACE dataset.

Face detection is a computer vision application that increasingly demands lightweight models to facilitate deployment on devices with limited computational resources. Neural network pruning is a promising technique that can effectively reduce network size without significantly affecting performance. In this work, we propose a novel face detection pruning pipeline that leverages Filter Pruning via Geometric Median (FPGM) pruning, Soft Filter Pruning (SFP) and Bayesian optimization in order to achieve a superior trade-off between size and performance compared to existing approaches. FPGM pruning is a structured pruning technique that allows pruning the least significant filters in each layer, while SFP iteratively prunes the filters and allows them to be updated in any subsequent training step. Bayesian optimization is employed in order to optimize the pruning rates of each layer, rather than relying on engineering expertise to determine the optimal pruning rates for each layer. In our experiments across all three subsets of the WIDER FACE dataset, our proposed approach B-FPGM consistently outperforms existing ones in balancing model size and performance. All our experiments were applied to EResFD, the currently smallest (in number of parameters) well-performing face detector of the literature; a small ablation study with a second small face detector, EXTD, is also reported. The source code and trained pruned face detection models can be found at: https://github.com/IDTITI/B-FPGM.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes