AICVJan 30, 2023

DepGraph: Towards Any Structural Pruning

arXiv:2301.12900v2496 citationsh-index: 67
Originality Incremental advance
AI Analysis

This work addresses the challenge of generalizing structural pruning to arbitrary architectures, which is crucial for efficient model deployment in various domains, though it builds incrementally on existing pruning techniques.

The authors tackled the problem of structural pruning across diverse neural network architectures by proposing DepGraph, a fully automatic method that models dependencies between layers to group coupled parameters, achieving consistent performance improvements across CNNs, RNNs, GNNs, and Transformers.

Structural pruning enables model acceleration by removing structurally-grouped parameters from neural networks. However, the parameter-grouping patterns vary widely across different models, making architecture-specific pruners, which rely on manually-designed grouping schemes, non-generalizable to new architectures. In this work, we study a highly-challenging yet barely-explored task, any structural pruning, to tackle general structural pruning of arbitrary architecture like CNNs, RNNs, GNNs and Transformers. The most prominent obstacle towards this goal lies in the structural coupling, which not only forces different layers to be pruned simultaneously, but also expects all removed parameters to be consistently unimportant, thereby avoiding structural issues and significant performance degradation after pruning. To address this problem, we propose a general and {fully automatic} method, \emph{Dependency Graph} (DepGraph), to explicitly model the dependency between layers and comprehensively group coupled parameters for pruning. In this work, we extensively evaluate our method on several architectures and tasks, including ResNe(X)t, DenseNet, MobileNet and Vision transformer for images, GAT for graph, DGCNN for 3D point cloud, alongside LSTM for language, and demonstrate that, even with a simple norm-based criterion, the proposed method consistently yields gratifying performances.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes