CVLGSep 24, 2023

Causal-DFQ: Causality Guided Data-free Network Quantization

arXiv:2309.13682v18 citationsh-index: 39Has Code
Originality Incremental advance
AI Analysis

This work addresses a practical limitation in deploying compressed neural networks on mobile and edge devices by enabling quantization without access to original data, though it is incremental as it builds on existing data-free approaches.

The paper tackles the problem of data-free network quantization, where training data is unavailable due to privacy or security concerns, by proposing Causal-DFQ, a method that uses causal reasoning to guide image synthesis and distribution alignment, achieving competitive performance on benchmarks like ImageNet with top-1 accuracy improvements of up to 1.2% over prior methods.

Model quantization, which aims to compress deep neural networks and accelerate inference speed, has greatly facilitated the development of cumbersome models on mobile and edge devices. There is a common assumption in quantization methods from prior works that training data is available. In practice, however, this assumption cannot always be fulfilled due to reasons of privacy and security, rendering these methods inapplicable in real-life situations. Thus, data-free network quantization has recently received significant attention in neural network compression. Causal reasoning provides an intuitive way to model causal relationships to eliminate data-driven correlations, making causality an essential component of analyzing data-free problems. However, causal formulations of data-free quantization are inadequate in the literature. To bridge this gap, we construct a causal graph to model the data generation and discrepancy reduction between the pre-trained and quantized models. Inspired by the causal understanding, we propose the Causality-guided Data-free Network Quantization method, Causal-DFQ, to eliminate the reliance on data via approaching an equilibrium of causality-driven intervened distributions. Specifically, we design a content-style-decoupled generator, synthesizing images conditioned on the relevant and irrelevant factors; then we propose a discrepancy reduction loss to align the intervened distributions of the pre-trained and quantized models. It is worth noting that our work is the first attempt towards introducing causality to data-free quantization problem. Extensive experiments demonstrate the efficacy of Causal-DFQ. The code is available at https://github.com/42Shawn/Causal-DFQ.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes