LGSep 27, 2021

Consistency Training of Multi-exit Architectures for Sensor Data

arXiv:2109.13192v11 citations
Originality Incremental advance
AI Analysis

This work addresses computational and energy constraints for deploying deep learning models on resource-limited devices, representing an incremental improvement in training techniques for multi-exit architectures.

The paper tackles the problem of computational inefficiency in deep neural networks for sensor data by introducing a consistency training method for multi-exit architectures, resulting in earlier exits with improved detection rates without executing all layers.

Deep neural networks have become larger over the years with increasing demand of computational resources for inference; incurring exacerbate costs and leaving little room for deployment on devices with limited battery and other resources for real-time applications. The multi-exit architectures are type of deep neural network that are interleaved with several output (or exit) layers at varying depths of the model. They provide a sound approach for improving computational time and energy utilization of running a model through producing predictions from early exits. In this work, we present a novel and architecture-agnostic approach for robust training of multi-exit architectures termed consistent exit training. The crux of the method lies in a consistency-based objective to enforce prediction invariance over clean and perturbed inputs. We leverage weak supervision to align model output with consistency training and jointly optimize dual-losses in a multi-task learning fashion over the exits in a network. Our technique enables exit layers to generalize better when confronted with increasing uncertainty, hence, resulting in superior quality-efficiency trade-offs. We demonstrate through extensive evaluation on challenging learning tasks involving sensor data that our approach allows examples to exit earlier with better detection rate and without executing all the layers in a deep model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes