Robust Active Learning for Electrocardiographic Signal Classification
This work addresses the healthcare industry's need for efficient ECG signal classification by reducing labeling costs and handling data issues, though it is incremental as it builds on existing active learning techniques.
The paper tackled the problem of classifying electrocardiographic (ECG) signals by addressing challenges like data imbalance and noisy labels, proposing a robust active learning method that selects informative instances via clustering and reduces label noise, with experiments on the MIT-BIH arrhythmia database showing effectiveness.
The classification of electrocardiographic (ECG) signals is a challenging problem for healthcare industry. Traditional supervised learning methods require a large number of labeled data which is usually expensive and difficult to obtain for ECG signals. Active learning is well-suited for ECG signal classification as it aims at selecting the best set of labeled data in order to maximize the classification performance. Motivated by the fact that ECG data are usually heavily unbalanced among different classes and the class labels are noisy as they are manually labeled, this paper proposes a novel solution based on robust active learning for addressing these challenges. The key idea is to first apply the clustering of the data in a low dimensional embedded space and then select the most information instances within local clusters. By selecting the most informative instances relying on local average minimal distances, the algorithm tends to select the data for labelling in a more diversified way. Finally, the robustness of the model is further enhanced by adding a novel noisy label reduction scheme after the selection of the labeled data. Experiments on the ECG signal classification from the MIT-BIH arrhythmia database demonstrate the effectiveness of the proposed algorithm.