AS AIFeb 2

HuPER: A Human-Inspired Framework for Phonetic Perception

Chenxu Guo, Jiachen Lian, Yisi Liu, Baihe Huang, Shriyaa Narayanan, Cheol Jun Cho, Gopala Anumanchipalli

arXiv:2602.01634v12.32 citationsh-index: 25Has Code

Originality Highly original

AI Analysis

This work addresses phonetic perception for speech processing, offering a novel framework with broad applicability across languages.

The authors tackled phonetic perception by proposing HuPER, a human-inspired framework that models it as adaptive inference over acoustic-phonetic evidence and linguistic knowledge, achieving state-of-the-art phonetic error rates on five English benchmarks and strong zero-shot transfer to 95 unseen languages with only 100 hours of training data.

We propose HuPER, a human-inspired framework that models phonetic perception as adaptive inference over acoustic-phonetics evidence and linguistic knowledge. With only 100 hours of training data, HuPER achieves state-of-the-art phonetic error rates on five English benchmarks and strong zero-shot transfer to 95 unseen languages. HuPER is also the first framework to enable adaptive, multi-path phonetic perception under diverse acoustic conditions. All training data, models, and code are open-sourced. Code and demo avaliable at https://github.com/HuPER29/HuPER.

View on arXiv PDF Code

Similar