CLAILGMar 13, 2025

Ensemble Learning for Large Language Models in Text and Code Generation: A Survey

arXiv:2503.13505v213 citationsh-index: 8
Originality Synthesis-oriented
AI Analysis

This is an incremental survey that aids researchers and practitioners in selecting and extending ensemble strategies for LLMs in real-world applications.

The paper surveys ensemble learning techniques for large language models (LLMs) to address issues like inconsistent outputs and biases in text and code generation, categorizing methods and highlighting benefits such as improved diversity and output quality.

Generative Pretrained Transformers (GPTs) are foundational Large Language Models (LLMs) for text generation. However, individual LLMs often produce inconsistent outputs and exhibit biases, limiting their representation of diverse language patterns. The closed-source nature of many powerful LLMs further restricts industry applications due to data privacy concerns. Inspired by successes in text generation, LLM ensemble techniques are now increasingly explored for code generation. This article reviews these emerging ensemble approaches to enhance understanding, encourage further research, and promote practical implementation in both text and code generation. We categorize LLM ensembles into seven main methods - weight merging, knowledge fusion, mixture-of-experts, reward ensemble, output ensemble, routing, and cascading - analyzing capabilities of those approaches. Our findings highlight key benefits such as improved diversity representation, enhanced output quality, and greater application flexibility. These insights aid model selection for real-world tasks and crucially, lay groundwork for extending ensemble strategies to multimodal LLMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes