CLNov 16, 2023

Fumbling in Babel: An Investigation into ChatGPT's Language Identification Ability

arXiv:2311.09696v232 citationsh-index: 19
Originality Synthesis-oriented
AI Analysis

This work highlights limitations in large language models for serving diverse linguistic communities, with incremental benchmarking.

The study investigated ChatGPT's ability to identify languages, finding that it performs poorly compared to specialized tools, especially on African languages, using a benchmark of 670 languages.

ChatGPT has recently emerged as a powerful NLP tool that can carry out a variety of tasks. However, the range of languages ChatGPT can handle remains largely a mystery. To uncover which languages ChatGPT `knows', we investigate its language identification (LID) abilities. For this purpose, we compile Babel-670, a benchmark comprising 670 languages representing 24 language families spoken in five continents. Languages in Babel-670 run the gamut from the very high-resource to the very low-resource. We then study ChatGPT's (both GPT-3.5 and GPT-4) ability to (i) identify language names and language codes (ii) under zero- and few-shot conditions (iii) with and without provision of a label set. When compared to smaller finetuned LID tools, we find that ChatGPT lags behind. For example, it has poor performance on African languages. We conclude that current large language models would benefit from further development before they can sufficiently serve diverse communities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes