SDCLLGASDec 17, 2023

A review-based study on different Text-to-Speech technologies

arXiv:2312.11563v18 citationsh-index: 4
Originality Synthesis-oriented
AI Analysis

It provides insights for researchers, developers, and users to understand TTS technologies for specific applications, but is incremental as it is a review-based study.

This paper reviews various Text-to-Speech technologies, comparing their advantages and limitations in terms of naturalness, complexity, and application suitability, and explores recent advancements like neural and hybrid TTS.

This research paper presents a comprehensive review-based study on various Text-to-Speech (TTS) technologies. TTS technology is an important aspect of human-computer interaction, enabling machines to convert written text into audible speech. The paper examines the different TTS technologies available, including concatenative TTS, formant synthesis TTS, and statistical parametric TTS. The study focuses on comparing the advantages and limitations of these technologies in terms of their naturalness of voice, the level of complexity of the system, and their suitability for different applications. In addition, the paper explores the latest advancements in TTS technology, including neural TTS and hybrid TTS. The findings of this research will provide valuable insights for researchers, developers, and users who want to understand the different TTS technologies and their suitability for specific applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes