CLMay 31, 2023

Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers

arXiv:2305.19474v1234 citations
Originality Synthesis-oriented
AI Analysis

This work highlights ethical concerns for Indigenous communities in low-resource language translation, but it is incremental as it builds on existing ethical discussions without introducing new methods.

The paper addresses ethical issues in machine translation of Indigenous languages by surveying literature and conducting interviews with community leaders, teachers, and activists, finding that inclusion of native speakers is crucial for more ethical research.

In recent years machine translation has become very successful for high-resource language pairs. This has also sparked new interest in research on the automatic translation of low-resource languages, including Indigenous languages. However, the latter are deeply related to the ethnic and cultural groups that speak (or used to speak) them. The data collection, modeling and deploying machine translation systems thus result in new ethical questions that must be addressed. Motivated by this, we first survey the existing literature on ethical considerations for the documentation, translation, and general natural language processing for Indigenous languages. Afterward, we conduct and analyze an interview study to shed light on the positions of community leaders, teachers, and language activists regarding ethical concerns for the automatic translation of their languages. Our results show that the inclusion, at different degrees, of native speakers and community members is vital to performing better and more ethical research on Indigenous languages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes