Song Form-aware Full-Song Text-to-Lyrics Generation with Multi-Level Granularity Syllable Count Control
This addresses the challenge of unnatural phrasing in lyrics generation for songwriters and musicians, though it appears incremental as it builds on existing text-to-lyrics methods with enhanced control.
The paper tackles the problem of generating full-song lyrics with precise syllable control across multiple granularities and adherence to song form structures, resulting in a framework that produces complete lyrics aligned with specified constraints.
Lyrics generation presents unique challenges, particularly in achieving precise syllable control while adhering to song form structures such as verses and choruses. Conventional line-by-line approaches often lead to unnatural phrasing, underscoring the need for more granular syllable management. We propose a framework for lyrics generation that enables multi-level syllable control at the word, phrase, line, and paragraph levels, aware of song form. Our approach generates complete lyrics conditioned on input text and song form, ensuring alignment with specified syllable constraints. Generated lyrics samples are available at: https://tinyurl.com/lyrics9999