LGCVSDASMLJun 1, 2019

Super-resolution of Time-series Labels for Bootstrapped Event Detection

arXiv:1906.00254v11 citations
Originality Incremental advance
AI Analysis

This addresses data scarcity in real-world scenarios like crowd-sourcing, where only small amounts of expert-labeled data are available, by incrementally improving label quality for event detection.

The paper tackles the problem of limited expert-labeled time-series data by developing a framework that super-resolves abundant low-quality labels using a Kernel Density Estimator (KDE), enabling a Convolutional Neural Network (CNN) to achieve a 22.1% improvement in F1 score over the next best baseline.

Solving real-world problems, particularly with deep learning, relies on the availability of abundant, quality data. In this paper we develop a novel framework that maximises the utility of time-series datasets that contain only small quantities of expertly-labelled data, larger quantities of weakly (or coarsely) labelled data and a large volume of unlabelled data. This represents scenarios commonly encountered in the real world, such as in crowd-sourcing applications. In our work, we use a nested loop using a Kernel Density Estimator (KDE) to super-resolve the abundant low-quality data labels, thereby enabling effective training of a Convolutional Neural Network (CNN). We demonstrate two key results: a) The KDE is able to super-resolve labels more accurately, and with better calibrated probabilities, than well-established classifiers acting as baselines; b) Our CNN, trained on super-resolved labels from the KDE, achieves an improvement in F1 score of 22.1% over the next best baseline system in our candidate problem domain.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes