SDApr 30

Accent Conversion: A Problem-Driven Survey of Sociolinguistic and Technical Constraints

arXiv:2604.2728147.9
Predicted impact top 60% in SD · last 90 daysOriginality Synthesis-oriented
AI Analysis

For researchers in speech processing and sociolinguistics, this survey provides a structured overview of technical and linguistic constraints in accent conversion, but it is an incremental review rather than introducing new results.

This survey reviews the evolution of accent conversion methods, from rule-based signal processing to neural architectures, addressing challenges in data alignment, representation disentanglement, and resource scarcity. It identifies persistent challenges and outlines future research directions for controllable and perceptually consistent conversion.

Accent conversion has rapidly progressed alongside growing interest in improving global cross-cultural communication. This survey presents an overview of the evolution of accent conversion methodologies, analyzing how the field has developed in response to fundamental challenges related to data alignment, representation disentanglement, and resource scarcity. We trace the progression from early rule-based digital signal processing approaches such as spectral manipulation and formant-based analysis to modern neural architectures capable of flexible and reference-free accent transformation. In addition, the survey situates accent conversion within its linguistic foundations and examines how different application requirements impose varying constraints on the balance between accent modification and speaker identity preservation. Finally, it reviews commonly used speech datasets and evaluation methodologies, identifies persistent challenges, and outlines directions for future research aimed at achieving more controllable and perceptually consistent accent conversion.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes