CLAINov 30, 2025

Accelerating Bangla NLP Tasks with Automatic Mixed Precision: Resource-Efficient Training Preserving Model Efficacy

arXiv:2512.00829v1h-index: 1
Originality Synthesis-oriented
AI Analysis

This work addresses computational barriers for NLP development in Bangla, especially in hardware-constrained settings, but it is incremental as it applies an existing method (AMP) to new data.

The paper tackled the problem of high computational resource requirements for training NLP models in Bangla by applying automatic mixed precision (AMP) training, resulting in a 44.5% faster training speed and 17.6% lower memory usage while maintaining F-1 scores within 99.7% of full-precision baselines.

Training models for Natural Language Processing (NLP) requires substantial computational resources and time, posing significant challenges, especially for NLP development in Bangla, where access to high-end hardware is often limited. In this work, we explore automatic mixed precision (AMP) training as a means to improve computational efficiency without sacrificing model performance. By leveraging a dynamic mix of 16-bit and 32-bit floating-point computations, AMP lowers GPU memory requirements and speeds up training without degrading model performance. We evaluate AMP across four standard Bangla NLP tasks, namely sentiment analysis, named entity recognition, error classification, and question answering, using four transformer-based models: BanglaBERT, BanglishBERT, XLM-R, and mBERT. Our results demonstrate that AMP accelerates training by 44.5% and reduces memory consumption by 17.6%, while maintaining F-1 score within 99.7% of the full-precision baselines. This empirical study highlights AMP's potential to democratize access to state-of-the-art NLP capabilities in hardware-constrained settings by lowering computational barriers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes