IndoNLP 2025: Shared Task on Real-Time Reverse Transliteration for Romanized Indo-Aryan languages
This work addresses the typing experience for users of low-resource Indo-Aryan languages by improving transliteration accuracy, though it is incremental as it builds on existing transliteration methods.
The paper tackled the problem of inaccurate real-time reverse transliteration for Romanized Indo-Aryan languages by introducing a shared task that evaluated models for Sinhala, Hindi, and Malayalam, with four teams participating in the final evaluation.
The paper overviews the shared task on Real-Time Reverse Transliteration for Romanized Indo-Aryan languages. It focuses on the reverse transliteration of low-resourced languages in the Indo-Aryan family to their native scripts. Typing Romanized Indo-Aryan languages using ad-hoc transliterals and achieving accurate native scripts are complex and often inaccurate processes with the current keyboard systems. This task aims to introduce and evaluate a real-time reverse transliterator that converts Romanized Indo-Aryan languages to their native scripts, improving the typing experience for users. Out of 11 registered teams, four teams participated in the final evaluation phase with transliteration models for Sinhala, Hindi and Malayalam. These proposed solutions not only solve the issue of ad-hoc transliteration but also empower low-resource language usability in the digital arena.