ASSDOct 5, 2020

JSSS: free Japanese speech corpus for summarization and simplification

arXiv:2010.01793v13 citations
Originality Synthesis-oriented
AI Analysis

This provides a resource for researchers working on Japanese speech processing, but it is incremental as it focuses on creating a dataset rather than advancing methods.

The authors constructed JSSS, a free Japanese speech corpus for speech-based summarization and simplification, containing voices recorded for tasks like duration-constrained text-to-speech summarization and speaking-style simplification.

In this paper, we construct a new Japanese speech corpus for speech-based summarization and simplification, "JSSS" (pronounced "j-triple-s"). Given the success of reading-style speech synthesis from short-form sentences, we aim to design more difficult tasks for delivering information to humans. Our corpus contains voices recorded for two tasks that have a role in providing information under constraints: duration-constrained text-to-speech summarization and speaking-style simplification. It also contains utterances of long-form sentences as an optional task. This paper describes how we designed the corpus, which is available on our project page.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes