CL AINov 30, 2025

Accelerating Bangla NLP Tasks with Automatic Mixed Precision: Resource-Efficient Training Preserving Model Efficacy

Md Mehrab Hossain Opi, Sumaiya Khan, Moshammad Farzana Rahman

arXiv:2512.00829v1h-index: 1

Originality Synthesis-oriented

AI Analysis

This work addresses computational barriers for NLP development in Bangla, especially in hardware-constrained settings, but it is incremental as it applies an existing method (AMP) to new data.

The paper tackled the problem of high computational resource requirements for training NLP models in Bangla by applying automatic mixed precision (AMP) training, resulting in a 44.5% faster training speed and 17.6% lower memory usage while maintaining F-1 scores within 99.7% of full-precision baselines.

Training models for Natural Language Processing (NLP) requires substantial computational resources and time, posing significant challenges, especially for NLP development in Bangla, where access to high-end hardware is often limited. In this work, we explore automatic mixed precision (AMP) training as a means to improve computational efficiency without sacrificing model performance. By leveraging a dynamic mix of 16-bit and 32-bit floating-point computations, AMP lowers GPU memory requirements and speeds up training without degrading model performance. We evaluate AMP across four standard Bangla NLP tasks, namely sentiment analysis, named entity recognition, error classification, and question answering, using four transformer-based models: BanglaBERT, BanglishBERT, XLM-R, and mBERT. Our results demonstrate that AMP accelerates training by 44.5% and reduces memory consumption by 17.6%, while maintaining F-1 score within 99.7% of the full-precision baselines. This empirical study highlights AMP's potential to democratize access to state-of-the-art NLP capabilities in hardware-constrained settings by lowering computational barriers.

View on arXiv PDF

Similar