SD AI LG ASNov 22, 2024

Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network

arXiv:2411.15082v11 citationsh-index: 1Has Code

Originality Synthesis-oriented

AI Analysis

This addresses speaker identification for security and personal assistant applications, but it is incremental as it builds on existing methods with minor adaptations.

The paper tackles speaker identification with minimal datasets and constrained resources by proposing a lightweight 1D-Convolutional Neural Network, achieving a validation accuracy of 97.87%.

Voice recognition and speaker identification are vital for applications in security and personal assistants. This paper presents a lightweight 1D-Convolutional Neural Network (1D-CNN) designed to perform speaker identification on minimal datasets. Our approach achieves a validation accuracy of 97.87%, leveraging data augmentation techniques to handle background noise and limited training samples. Future improvements include testing on larger datasets and integrating transfer learning methods to enhance generalizability. We provide all code, the custom dataset, and the trained models to facilitate reproducibility. These resources are available on our GitHub repository: https://github.com/IrfanNafiz/RecMe.

View on arXiv PDF Code

Similar