SDAIASMay 19, 2025

The Computation of Generalized Embeddings for Underwater Acoustic Target Recognition using Contrastive Learning

arXiv:2505.12904v14 citationsh-index: 25Applied Acoustics
Originality Incremental advance
AI Analysis

This addresses the challenge of limited labeled data for ocean noise monitoring, offering a potential solution for automatic underwater acoustic analysis, though it is incremental as it adapts existing unsupervised techniques to a specific domain.

The paper tackled the problem of underwater acoustic target recognition by developing an unsupervised contrastive learning method using a Conformer-based encoder and VICReg loss on unlabeled data, which produced robust and generalized embeddings for classifying ship types and marine mammal vocalizations.

The increasing level of sound pollution in marine environments poses an increased threat to ocean health, making it crucial to monitor underwater noise. By monitoring this noise, the sources responsible for this pollution can be mapped. Monitoring is performed by passively listening to these sounds. This generates a large amount of data records, capturing a mix of sound sources such as ship activities and marine mammal vocalizations. Although machine learning offers a promising solution for automatic sound classification, current state-of-the-art methods implement supervised learning. This requires a large amount of high-quality labeled data that is not publicly available. In contrast, a massive amount of lower-quality unlabeled data is publicly available, offering the opportunity to explore unsupervised learning techniques. This research explores this possibility by implementing an unsupervised Contrastive Learning approach. Here, a Conformer-based encoder is optimized by the so-called Variance-Invariance-Covariance Regularization loss function on these lower-quality unlabeled data and the translation to the labeled data is made. Through classification tasks involving recognizing ship types and marine mammal vocalizations, our method demonstrates to produce robust and generalized embeddings. This shows to potential of unsupervised methods for various automatic underwater acoustic analysis tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes