CVJan 15, 2018

Deep Net Triage: Analyzing the Importance of Network Layers via Structural Compression

arXiv:1801.04651v25 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of interpreting deep network structures for researchers, but it is incremental as it builds on existing compression and fine-tuning techniques.

The paper tackles the problem of understanding the importance of layers in deep neural networks by analyzing their criticality through structural compression, finding that model accuracy does not depend on which layer is compressed and can be recovered or exceeded with fine-tuning.

Despite their prevalence, deep networks are poorly understood. This is due, at least in part, to their highly parameterized nature. As such, while certain structures have been found to work better than others, the significance of a model's unique structure, or the importance of a given layer, and how these translate to overall accuracy, remains unclear. In this paper, we analyze these properties of deep neural networks via a process we term deep net triage. Like medical triage---the assessment of the importance of various wounds---we assess the importance of layers in a neural network, or as we call it, their criticality. We do this by applying structural compression, whereby we reduce a block of layers to a single layer. After compressing a set of layers, we apply a combination of initialization and training schemes, and look at network accuracy, convergence, and the layer's learned filters to assess the criticality of the layer. We apply this analysis across four data sets of varying complexity. We find that the accuracy of the model does not depend on which layer was compressed; that accuracy can be recovered or exceeded after compression by fine-tuning across the entire model; and, lastly, that Knowledge Distillation can be used to hasten convergence of a compressed network, but constrains the accuracy attainable to that of the base model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes