SPAIJul 27, 2025

A Multi-Stage Hybrid CNN-Transformer Network for Automated Pediatric Lung Sound Classification

arXiv:2507.20408v24 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses pediatric respiratory disease diagnosis, particularly for children under 6 years in resource-limited settings, by providing a scalable solution, though it is incremental as it builds on existing methods.

The paper tackled automated pediatric lung sound classification by proposing a multi-stage hybrid CNN-Transformer network, achieving scores of 0.9039 in binary and 0.8448 in multiclass event classification, with improvements of 3.81% and 5.94% over previous models.

Automated analysis of lung sound auscultation is essential for monitoring respiratory health, especially in regions facing a shortage of skilled healthcare workers. While respiratory sound classification has been widely studied in adults, its ap plication in pediatric populations, particularly in children aged <6 years, remains an underexplored area. The developmental changes in pediatric lungs considerably alter the acoustic proper ties of respiratory sounds, necessitating specialized classification approaches tailored to this age group. To address this, we propose a multistage hybrid CNN-Transformer framework that combines CNN-extracted features with an attention-based architecture to classify pediatric respiratory diseases using scalogram images from both full recordings and individual breath events. Our model achieved an overall score of 0.9039 in binary event classifi cation and 0.8448 in multiclass event classification by employing class-wise focal loss to address data imbalance. At the recording level, the model attained scores of 0.720 for ternary and 0.571 for multiclass classification. These scores outperform the previous best models by 3.81% and 5.94%, respectively. This approach offers a promising solution for scalable pediatric respiratory disease diagnosis, especially in resource-limited settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes