LG AISep 11, 2024

A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption

Marcus Rüb, Philipp Tuchel, Axel Sikora, Daniel Mueller-Gritschneder

arXiv:2409.07114v111.512 citationsh-index: 4

Originality Incremental advance

AI Analysis

This addresses the problem of catastrophic forgetting and memory inefficiency for developers deploying machine learning on low-performance embedded devices like microcontrollers, though it appears incremental as it builds on existing knowledge distillation and model adaptation techniques.

The paper tackles incremental learning for TinyML on resource-constrained devices by proposing an algorithm that uses dataset distillation and dynamic model size adaptation, achieving a negligible 1% accuracy loss while using only 43% of FLOPs and 1% of the original dataset memory.

A new algorithm for incremental learning in the context of Tiny Machine learning (TinyML) is presented, which is optimized for low-performance and energy efficient embedded devices. TinyML is an emerging field that deploys machine learning models on resource-constrained devices such as microcontrollers, enabling intelligent applications like voice recognition, anomaly detection, predictive maintenance, and sensor data processing in environments where traditional machine learning models are not feasible. The algorithm solve the challenge of catastrophic forgetting through the use of knowledge distillation to create a small, distilled dataset. The novelty of the method is that the size of the model can be adjusted dynamically, so that the complexity of the model can be adapted to the requirements of the task. This offers a solution for incremental learning in resource-constrained environments, where both model size and computational efficiency are critical factors. Results show that the proposed algorithm offers a promising approach for TinyML incremental learning on embedded devices. The algorithm was tested on five datasets including: CIFAR10, MNIST, CORE50, HAR, Speech Commands. The findings indicated that, despite using only 43% of Floating Point Operations (FLOPs) compared to a larger fixed model, the algorithm experienced a negligible accuracy loss of just 1%. In addition, the presented method is memory efficient. While state-of-the-art incremental learning is usually very memory intensive, the method requires only 1% of the original data set.

View on arXiv PDF

Similar