CLSep 7, 2015

Enhancing Automatically Discovered Multi-level Acoustic Patterns Considering Context Consistency With Applications in Spoken Term Detection

Cheng-Tao Chung, Wei-Ning Hsu, Cheng-Yi Lee, Lin-Shan Lee

arXiv:1509.02217v12.25 citations

Originality Incremental advance

AI Analysis

This is an incremental improvement for spoken term detection systems, enhancing pattern discovery in speech processing.

The paper tackles the problem of improving automatically discovered acoustic patterns by relabeling pattern indices to ensure context consistency across different HMM sets, resulting in good improvements in spoken term detection experiments on TIMIT and Mandarin Broadcast News.

This paper presents a novel approach for enhancing the multiple sets of acoustic patterns automatically discovered from a given corpus. In a previous work it was proposed that different HMM configurations (number of states per model, number of distinct models) for the acoustic patterns form a two-dimensional space. Multiple sets of acoustic patterns automatically discovered with the HMM configurations properly located on different points over this two-dimensional space were shown to be complementary to one another, jointly capturing the characteristics of the given corpus. By representing the given corpus as sequences of acoustic patterns on different HMM sets, the pattern indices in these sequences can be relabeled considering the context consistency across the different sequences. Good improvements were observed in preliminary experiments of pattern spoken term detection (STD) performed on both TIMIT and Mandarin Broadcast News with such enhanced patterns.

View on arXiv PDF

Similar