LGAIJul 15, 2024

On-Device Training of Fully Quantized Deep Neural Networks on Cortex-M Microcontrollers

arXiv:2407.10734v214 citationsh-index: 18
Originality Incremental advance
AI Analysis

This enables adaptive DNNs on low-power microcontrollers for edge applications, but it is incremental as it builds on existing quantization and training techniques.

The paper tackled on-device training of deep neural networks on resource-constrained Cortex-M microcontrollers by developing a method using fully quantized training and dynamic partial gradient updates, demonstrating feasibility on vision and time-series datasets with tradeoffs in accuracy, memory, energy, and latency.

On-device training of DNNs allows models to adapt and fine-tune to newly collected data or changing domains while deployed on microcontroller units (MCUs). However, DNN training is a resource-intensive task, making the implementation and execution of DNN training algorithms on MCUs challenging due to low processor speeds, constrained throughput, limited floating-point support, and memory constraints. In this work, we explore on-device training of DNNs for Cortex-M MCUs. We present a method that enables efficient training of DNNs completely in place on the MCU using fully quantized training (FQT) and dynamic partial gradient updates. We demonstrate the feasibility of our approach on multiple vision and time-series datasets and provide insights into the tradeoff between training accuracy, memory overhead, energy, and latency on real hardware.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes