CVAINov 8, 2025

GABFusion: Rethinking Feature Fusion for Low-Bit Quantization of Multi-Task Networks

arXiv:2511.05898v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses the problem of maintaining accuracy in low-bit quantized multi-task networks for computer vision applications, offering an incremental improvement with modular integration.

The paper tackles performance degradation in quantized multi-task networks by proposing GABFusion and ADA, achieving average mAP improvements of 3.3% on PASCAL VOC and 1.6% on COCO, and reducing the accuracy gap to 1.7% for a 4-bit YOLOv5 model.

Despite the effectiveness of quantization-aware training (QAT) in compressing deep neural networks, its performance on multi-task architectures often degrades significantly due to task-specific feature discrepancies and gradient conflicts. To address these challenges, we propose Gradient-Aware Balanced Feature Fusion (GABFusion), which dynamically balances gradient magnitudes and fuses task-specific features in a quantization-friendly manner. We further introduce Attention Distribution Alignment (ADA), a feature-level distillation strategy tailored for quantized models. Our method demonstrates strong generalization across network architectures and QAT algorithms, with theoretical guarantees on gradient bias reduction. Extensive experiments demonstrate that our strategy consistently enhances a variety of QAT methods across different network architectures and bit-widths. On PASCAL VOC and COCO datasets, the proposed approach achieves average mAP improvements of approximately 3.3% and 1.6%, respectively. When applied to YOLOv5 under 4-bit quantization, our method narrows the accuracy gap with the full-precision model to only 1.7% on VOC, showcasing its effectiveness in preserving performance under low-bit constraints. Notably, the proposed framework is modular, easy to integrate, and compatible with any existing QAT technique-enhancing the performance of quantized models without requiring modifications to the original network architecture.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes