LGJun 14, 2022

Using Machine Learning to Augment Dynamic Time Warping Based Signal Classification

arXiv:2206.07200v13 citationsh-index: 1
Originality Incremental advance
AI Analysis

This addresses the speed-accuracy trade-off in signal processing for applications like health monitoring, though it is incremental as it builds on FastDTW with domain-specific learning.

The paper tackled the computational expense of Dynamic Time Warping (DTW) for signal classification by introducing MLDTW, which uses machine learning to learn domain-specific warps and limit the search space, resulting in at least comparable speed to FastDTW and a 60% average error reduction across four datasets.

Modern applications such as voice recognition rely on the ability to compare signals to pre-recorded ones to classify them. However, this comparison typically needs to ignore differences due to signal noise, temporal offset, signal magnitude, and other external factors. The Dynamic Time Warping (DTW) algorithm quantifies this similarity by finding corresponding regions between the signals and non-linearly warping one signal by stretching and shrinking it. Unfortunately, searching through all "warps" of a signal to find the best corresponding regions is computationally expensive. The FastDTW algorithm improves performance, but sacrifices accuracy by only considering small signal warps. My goal is to improve the speed of DTW while maintaining high accuracy. My key insight is that in any particular application domain, signals exhibit specific types of variation. For example, the accelerometer signal measured for two different people would differ based on their stride length and weight. My system, called Machine Learning DTW (MLDTW), uses machine learning to learn the types of warps that are common in a particular domain. It then uses the learned model to improve DTW performance by limiting the search of potential warps appropriately. My results show that compared to FastDTW, MLDTW is at least as fast and reduces errors by 60% on average across four different data sets. These improvements will significantly impact a wide variety of applications (e.g. health monitoring) and enable more scalable processing of multivariate, higher frequency, and longer signal recordings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes