NECVLGJul 31, 2022

enpheeph: A Fault Injection Framework for Spiking and Compressed Deep Neural Networks

arXiv:2208.00328v117 citationsh-index: 20Has Code
Originality Incremental advance
AI Analysis

This work addresses reliability analysis for safety-critical applications like autonomous driving, providing a tool for fault injection in emerging DNN models, but it is incremental as it builds on existing resiliency analysis methods.

The paper tackles the reliability issues in deploying spiking and compressed deep neural networks by introducing enpheeph, a fault injection framework that enables optimized execution on hardware with minimal code modifications, showing that DNNs can experience over 40% accuracy drop at a fault rate of 7 x 10^(-7) per parameter and achieving at least 10x lower run-time overhead than state-of-the-art frameworks.

Research on Deep Neural Networks (DNNs) has focused on improving performance and accuracy for real-world deployments, leading to new models, such as Spiking Neural Networks (SNNs), and optimization techniques, e.g., quantization and pruning for compressed networks. However, the deployment of these innovative models and optimization techniques introduces possible reliability issues, which is a pillar for DNNs to be widely used in safety-critical applications, e.g., autonomous driving. Moreover, scaling technology nodes have the associated risk of multiple faults happening at the same time, a possibility not addressed in state-of-the-art resiliency analyses. Towards better reliability analysis for DNNs, we present enpheeph, a Fault Injection Framework for Spiking and Compressed DNNs. The enpheeph framework enables optimized execution on specialized hardware devices, e.g., GPUs, while providing complete customizability to investigate different fault models, emulating various reliability constraints and use-cases. Hence, the faults can be executed on SNNs as well as compressed networks with minimal-to-none modifications to the underlying code, a feat that is not achievable by other state-of-the-art tools. To evaluate our enpheeph framework, we analyze the resiliency of different DNN and SNN models, with different compression techniques. By injecting a random and increasing number of faults, we show that DNNs can show a reduction in accuracy with a fault rate as low as 7 x 10 ^ (-7) faults per parameter, with an accuracy drop higher than 40%. Run-time overhead when executing enpheeph is less than 20% of the baseline execution time when executing 100 000 faults concurrently, at least 10x lower than state-of-the-art frameworks, making enpheeph future-proof for complex fault injection scenarios. We release enpheeph at https://github.com/Alexei95/enpheeph.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes