ASCLSDDec 23, 2018

Pansori: ASR Corpus Generation from Open Online Video Contents

arXiv:1812.09798v16 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This provides a valuable resource for researchers and developers working on Korean language ASR, though it is incremental as it applies existing methods to new data.

The authors tackled the lack of freely available high-quality Korean speech recognition datasets by introducing Pansori, a program that semi-automatically generates ASR corpora from online videos, resulting in the creation of the Pansori-TEDxKR dataset, which is the first such corpus for Korean.

This paper introduces Pansori, a program used to create ASR (automatic speech recognition) corpora from online video contents. It utilizes a cloud-based speech API to easily create a corpus in different languages. Using this program, we semi-automatically generated the Pansori-TEDxKR dataset from Korean TED conference talks with community-transcribed subtitles. It is the first high-quality corpus for the Korean language freely available for independent research. Pansori is released as an open-source software and the generated corpus is released under a permissive public license for community use and participation.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes