CVNov 17, 2024

Electrostatic Force Regularization for Neural Structured Pruning

arXiv:2411.11079v11 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses the problem of complex and resource-intensive pruning for real-time applications on devices, though it appears incremental as it builds on existing structured pruning approaches.

The paper tackles the challenge of deploying deep convolutional neural networks on resource-constrained devices by proposing a novel structured pruning method that uses electrostatic force concepts to optimize weights and filter importance without architectural changes or extensive fine-tuning, achieving competitive performance on MNIST, CIFAR, and ImageNet datasets.

The demand for deploying deep convolutional neural networks (DCNNs) on resource-constrained devices for real-time applications remains substantial. However, existing state-of-the-art structured pruning methods often involve intricate implementations, require modifications to the original network architectures, and necessitate an extensive fine-tuning phase. To overcome these challenges, we propose a novel method that, for the first time, incorporates the concepts of charge and electrostatic force from physics into the training process of DCNNs. The magnitude of this force is directly proportional to the product of the charges of the convolution filter and the source filter, and inversely proportional to the square of the distance between them. We applied this electrostatic-like force to the convolution filters, either attracting filters with opposite charges toward non-zero weights or repelling filters with like charges toward zero weights. Consequently, filters subject to repulsive forces have their weights reduced to zero, enabling their removal, while the attractive forces preserve filters with significant weights that retain information. Unlike conventional methods, our approach is straightforward to implement, does not require any architectural modifications, and simultaneously optimizes weights and ranks filter importance, all without the need for extensive fine-tuning. We validated the efficacy of our method on modern DCNN architectures using the MNIST, CIFAR, and ImageNet datasets, achieving competitive performance compared to existing structured pruning approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes