Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability
This work addresses the high computational resource demands and environmental implications of LLMs for AI practitioners and sustainability efforts, though it appears incremental as it builds on existing pruning methods.
The research tackled the problem of reducing the environmental impact of large language models (LLMs) by developing a novel pruning approach based on systematic weight evaluation during training, resulting in moderate pruning that enhanced efficiency and reduced loss without compromising performance, while excessive pruning caused drastic deterioration.
The exponential growth of large language models (LLMs) like ChatGPT has revolutionized artificial intelligence, offering unprecedented capabilities in natural language processing. However, the extensive computational resources required for training these models have significant environmental implications, including high carbon emissions, energy consumption, and water usage. This research presents a novel approach to LLM pruning, focusing on the systematic evaluation of individual weight importance throughout the training process. By monitoring parameter evolution over time, we propose a method that effectively reduces model size without compromising performance. Extensive experiments with both a scaled-down LLM and a large multimodal model reveal that moderate pruning enhances efficiency and reduces loss, while excessive pruning drastically deteriorates model performance. These findings highlight the critical need for optimized AI models to ensure sustainable development, balancing technological advancement with environmental responsibility.