CLSDASOct 8, 2025

Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis

arXiv:2510.07096v1h-index: 8
Originality Incremental advance
AI Analysis

This addresses the problem of generating nuanced sarcastic speech for applications in human-computer interaction, though it appears incremental as it builds on existing synthesis methods with new components.

The paper tackles the challenge of synthesizing sarcastic speech by proposing an LLM-enhanced retrieval-augmented framework that combines semantic embeddings from a fine-tuned LLaMA 3 with prosodic exemplars via RAG, integrated into a VITS backbone. Experiments show it outperforms baselines in objective measures and subjective evaluations, improving speech naturalness, sarcastic expressivity, and sarcasm detection.

Sarcasm is a subtle form of non-literal language that poses significant challenges for speech synthesis due to its reliance on nuanced semantic, contextual, and prosodic cues. While existing speech synthesis research has focused primarily on broad emotional categories, sarcasm remains largely unexplored. In this paper, we propose a Large Language Model (LLM)-enhanced Retrieval-Augmented framework for sarcasm-aware speech synthesis. Our approach combines (1) semantic embeddings from a LoRA-fine-tuned LLaMA 3, which capture pragmatic incongruity and discourse-level cues of sarcasm, and (2) prosodic exemplars retrieved via a Retrieval Augmented Generation (RAG) module, which provide expressive reference patterns of sarcastic delivery. Integrated within a VITS backbone, this dual conditioning enables more natural and contextually appropriate sarcastic speech. Experiments demonstrate that our method outperforms baselines in both objective measures and subjective evaluations, yielding improvements in speech naturalness, sarcastic expressivity, and downstream sarcasm detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes