SE CLFeb 26, 2025

IndicEval-XL: Bridging Linguistic Diversity in Code Generation Across Indic Languages

Ujjwal Singh, Aditi Sharma, Nikhil Gupta, Deepakshi, Vivek Kumar Jha

arXiv:2502.19067v13.4h-index: 4Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of English-centric evaluation for developers in Indic-speaking regions, though it is incremental as it extends existing benchmarking approaches to new languages.

The authors tackled the lack of multilingual benchmarks for code generation by creating IndicEval-XL, a comprehensive benchmark that includes 6 Indic languages and 12 programming languages, making it publicly available for research.

Large Language Models (LLMs) have demonstrated remarkable capabilities in code generation from natural language prompts, revolutionizing software development workflows. As we advance towards agent-based development paradigms, these models form the cornerstone of next-generation software development lifecycles. However, current benchmarks for evaluating multilingual code generation capabilities are predominantly English-centric, limiting their applicability across the global developer community. To address this limitation, we present IndicEval-XL, a comprehensive benchmark for code generation that incorporates 6 major Indic languages, collectively spoken by approximately 14\% of the world's population. Our benchmark bridges these languages with 12 programming languages, creating a robust evaluation framework. This work is particularly significant given India's representation of one-eighth of the global population and the crucial role Indic languages play in Indian society. IndicEval-XL represents a significant step toward expanding the linguistic diversity in code generation systems and evaluation frameworks. By developing resources that support multiple languages, we aim to make AI-powered development tools more inclusive and accessible to developers of various linguistic backgrounds. To facilitate further research and development in this direction, we make our dataset and evaluation benchmark publicly available at https://github.com/telekom/IndicEval-XL

View on arXiv PDF Code

Similar