LGCVSep 5, 2024

Unified Framework for Pre-trained Neural Network Compression via Decomposition and Optimized Rank Selection

arXiv:2409.03555v2h-index: 30
Originality Incremental advance
AI Analysis

This addresses model compression for deployment on resource-constrained devices, representing an incremental improvement to existing factorization methods.

The paper tackles the challenge of selecting appropriate rank for tensor decomposition in neural network compression by presenting a unified framework that simultaneously applies decomposition and rank selection with automatic rank search in continuous space. The method maintains performance of highly compressed models on par with original counterparts without needing additional training data.

Despite their high accuracy, complex neural networks demand significant computational resources, posing challenges for deployment on resource constrained devices such as mobile phones and embedded systems. Compression algorithms have been developed to address these challenges by reducing model size and computational demands while maintaining accuracy. Among these approaches, factorization methods based on tensor decomposition are theoretically sound and effective. However, they face difficulties in selecting the appropriate rank for decomposition. This paper tackles this issue by presenting a unified framework that simultaneously applies decomposition and rank selection, employing a composite compression loss within defined rank constraints. Our method includes an automatic rank search in a continuous space, efficiently identifying optimal rank configurations for the pre-trained model by eliminating the need for additional training data and reducing computational overhead in the search step. Combined with a subsequent fine-tuning step, our approach maintains the performance of highly compressed models on par with their original counterparts. Using various benchmark datasets and models, we demonstrate the efficacy of our method through a comprehensive analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes