CVMar 3, 2020

Anytime Inference with Distilled Hierarchical Neural Ensembles

arXiv:2003.01474v317 citations
AI Analysis

This addresses the need for flexible inference in scenarios with varying compute or data, offering an incremental improvement over previous anytime inference models.

The paper tackles the problem of computationally expensive inference in deep neural networks by proposing Hierarchical Neural Ensembles (HNE) for anytime inference, achieving state-of-the-art accuracy-compute trade-offs on CIFAR-10/100 and ImageNet datasets.

Inference in deep neural networks can be computationally expensive, and networks capable of anytime inference are important in mscenarios where the amount of compute or quantity of input data varies over time. In such networks the inference process can interrupted to provide a result faster, or continued to obtain a more accurate result. We propose Hierarchical Neural Ensembles (HNE), a novel framework to embed an ensemble of multiple networks in a hierarchical tree structure, sharing intermediate layers. In HNE we control the complexity of inference on-the-fly by evaluating more or less models in the ensemble. Our second contribution is a novel hierarchical distillation method to boost the prediction accuracy of small ensembles. This approach leverages the nested structure of our ensembles, to optimally allocate accuracy and diversity across the individual models. Our experiments show that, compared to previous anytime inference models, HNE provides state-of-the-art accuracy-computate trade-offs on the CIFAR-10/100 and ImageNet datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes