CVApr 18

When W4A4 Breaks Camouflaged Object Detection: Token-Group Dual-Constraint Activation Quantization

arXiv:2604.1685558.7h-index: 3
AI Analysis

Enables low-bit (W4A4) inference for Transformer-based camouflaged object detection, a task critical for on-device deployment, by addressing a task-specific quantization bottleneck.

Camouflaged object detection (COD) suffers severe accuracy loss under 4-bit quantization due to heavy-tailed background tokens dominating activation ranges. The proposed COD-TDQ method achieves over 0.12 higher Sα score than prior quantization methods across four benchmarks without retraining.

Camouflaged object detection (COD) segments objects that intentionally blend with the background, so predictions depend on subtle texture and boundary cues. COD is often needed under tight on-device memory and latency budgets, making low-bit inference highly desirable. However, COD is unusually hard to quantify aggressively. We study post-training W4A4 quantization of Transformer-based COD and find a task-specific cliff: heavy-tailed background tokens dominate a shared activation range, inflating the step size and pushing weak-but-structured boundary cues into the zero bin. This exposes a token-local bottleneck -- remove cross-token range domination and bound the zero-bin mass under 4-bit activations. To address this, we introduce COD-TDQ, a COD-aware Token-group Dual-constraint activation Quantization method. COD-TDQ addresses this token-local bottleneck with two coupled steps: Direct-Sum Token-Group (DSTG) assigns token-group scales to suppress cross-token range domination, and Dual-Constraint Range Projection (DCRP) projects each token-group clip range to keep the step-to-dispersion ratio and the zero-bin mass bounded. Across four COD benchmarks and two baseline models (CFRN and ESCNet), COD-TDQ consistently achieves an Sαscore more than 0.12 higher than that of the state-of-the-art quantization method without retraining. The code will be released.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes