Cloud-based Automatic Speech Recognition Systems for Southeast Asian Languages
This work addresses the lack of ASR systems for regional languages, which could benefit speakers and developers in Southeast Asia, but it appears incremental as it focuses on resource collection without new methods or results.
The paper tackles the problem of building automatic speech recognition systems for Southeast Asian languages, such as Bahasa Indonesia and Thai, by addressing challenges like limited speech and text resources and lack of linguistic knowledge, and it illustrates strategies for collecting necessary resources.
This paper provides an overall introduction of our Automatic Speech Recognition (ASR) systems for Southeast Asian languages. As not much existing work has been carried out on such regional languages, a few difficulties should be addressed before building the systems: limitation on speech and text resources, lack of linguistic knowledge, etc. This work takes Bahasa Indonesia and Thai as examples to illustrate the strategies of collecting various resources required for building ASR systems.