CLAILGMay 11, 2021

Including Signed Languages in Natural Language Processing

arXiv:2105.05222v2727 citations
Originality Synthesis-oriented
AI Analysis

It addresses the problem of excluding signed languages from NLP for deaf and hard of hearing communities, but is incremental as it proposes directions rather than presenting new results.

This position paper calls for including signed languages in NLP research to address the lack of linguistic modeling in existing Sign Language Processing, highlighting the need for tokenization, linguistically-informed models, data collection, and community involvement.

Signed languages are the primary means of communication for many deaf and hard of hearing individuals. Since signed languages exhibit all the fundamental linguistic properties of natural language, we believe that tools and theories of Natural Language Processing (NLP) are crucial towards its modeling. However, existing research in Sign Language Processing (SLP) seldom attempt to explore and leverage the linguistic organization of signed languages. This position paper calls on the NLP community to include signed languages as a research area with high social and scientific impact. We first discuss the linguistic properties of signed languages to consider during their modeling. Then, we review the limitations of current SLP models and identify the open challenges to extend NLP to signed languages. Finally, we urge (1) the adoption of an efficient tokenization method; (2) the development of linguistically-informed models; (3) the collection of real-world signed language data; (4) the inclusion of local signed language communities as an active and leading voice in the direction of research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes