Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos
This addresses the need for cost-effective and timely transcripts in e-learning, but it is incremental as it applies an existing method to new data without major innovations.
The study tackled the problem of generating transcripts for educational videos by evaluating Whisper's automatic speech recognition on 25 videos, identifying open research avenues for improving ASR in this domain.
Videos are increasingly being used for e-learning, and transcripts are vital to enhance the learning experience. The costs and delays of generating transcripts can be alleviated by automatic speech recognition (ASR) systems. In this article, we quantify the transcripts generated by whisper for 25 educational videos and identify some open avenues of research when leveraging ASR for transcribing educational videos.