ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi
This toolkit provides an easy-to-use Python interface for developers to build online ASR systems, but it is incremental as it extends existing Kaldi-based tools.
The paper introduces ExKaldi-RT, a real-time automatic speech recognition toolkit built on Kaldi and Python, which achieved competitive performance in benchmark experiments on the LibriSpeech corpus.
This paper describes the ExKaldi-RT online automatic speech recognition (ASR) toolkit that is implemented based on the Kaldi ASR toolkit and Python language. ExKaldi-RT provides tools for building online recognition pipelines. While similar tools are available built on Kaldi, a key feature of ExKaldi-RT that it works on Python, which has an easy-to-use interface that allows online ASR system developers to develop original research, such as by applying neural network-based signal processing and by decoding model trained with deep learning frameworks. We performed benchmark experiments on the minimum LibriSpeech corpus, and it showed that ExKaldi-RT could achieve competitive ASR performance in real-time recognition.