CLOct 11, 2022

Machine Translation between Spoken Languages and Signed Languages Represented in SignWriting

arXiv:2210.05404v2276 citationsh-index: 19
AI Analysis

This work addresses the lack of support for signed languages in machine translation systems, enabling their inclusion in natural language processing research.

The paper tackles machine translation between spoken and signed languages using SignWriting as an intermediate representation, achieving over 30 BLEU in bilingual translation and over 20 BLEU in multilingual setups.

This paper presents work on novel machine translation (MT) systems between spoken and signed languages, where signed languages are represented in SignWriting, a sign language writing system. Our work seeks to address the lack of out-of-the-box support for signed languages in current MT systems and is based on the SignBank dataset, which contains pairs of spoken language text and SignWriting content. We introduce novel methods to parse, factorize, decode, and evaluate SignWriting, leveraging ideas from neural factored MT. In a bilingual setup--translating from American Sign Language to (American) English--our method achieves over 30 BLEU, while in two multilingual setups--translating in both directions between spoken languages and signed languages--we achieve over 20 BLEU. We find that common MT techniques used to improve spoken language translation similarly affect the performance of sign language translation. These findings validate our use of an intermediate text representation for signed languages to include them in natural language processing research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes