ASLGSDMLMay 12, 2020

Infant Crying Detection in Real-World Environments

arXiv:2005.07036v645 citations
AI Analysis

This work addresses the challenge of reliable infant cry detection for caregivers and health monitoring in everyday settings, though it is incremental as it builds on established machine learning approaches.

The paper tackled the problem of infant cry detection in noisy real-world environments, finding that a model using deep spectrum and acoustic features achieved an F1 score of 0.613, outperforming existing methods in external validity.

Most existing cry detection models have been tested with data collected in controlled settings. Thus, the extent to which they generalize to noisy and lived environments is unclear. In this paper, we evaluate several established machine learning approaches including a model leveraging both deep spectrum and acoustic features. This model was able to recognize crying events with F1 score 0.613 (Precision: 0.672, Recall: 0.552), showing improved external validity over existing methods at cry detection in everyday real-world settings. As part of our evaluation, we collect and annotate a novel dataset of infant crying compiled from over 780 hours of labeled real-world audio data, captured via recorders worn by infants in their homes, which we make publicly available. Our findings confirm that a cry detection model trained on in-lab data underperforms when presented with real-world data (in-lab test F1: 0.656, real-world test F1: 0.236), highlighting the value of our new dataset and model.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes