SDAIASAug 4, 2025

CoughViT: A Self-Supervised Vision Transformer for Cough Audio Representation Learning

arXiv:2508.03764v11 citationsh-index: 9SemWeb
Originality Incremental advance
AI Analysis

This work addresses the challenge of limited data for diagnosing respiratory conditions beyond COVID-19, representing an incremental improvement in AI-based healthcare diagnostics.

The paper tackled the problem of label and data scarcity in AI-based diagnostic systems for respiratory diseases by proposing CoughViT, a self-supervised pre-training framework for cough audio representation learning, which matched or exceeded state-of-the-art supervised audio representations in enhancing performance on downstream cough classification tasks.

Physicians routinely assess respiratory sounds during the diagnostic process, providing insight into the condition of a patient's airways. In recent years, AI-based diagnostic systems operating on respiratory sounds, have demonstrated success in respiratory disease detection. These systems represent a crucial advancement in early and accessible diagnosis which is essential for timely treatment. However, label and data scarcity remain key challenges, especially for conditions beyond COVID-19, limiting diagnostic performance and reliable evaluation. In this paper, we propose CoughViT, a novel pre-training framework for learning general-purpose cough sound representations, to enhance diagnostic performance in tasks with limited data. To address label scarcity, we employ masked data modelling to train a feature encoder in a self-supervised learning manner. We evaluate our approach against other pre-training strategies on three diagnostically important cough classification tasks. Experimental results show that our representations match or exceed current state-of-the-art supervised audio representations in enhancing performance on downstream tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes