LG CLJun 17, 2024

$\texttt{MoE-RBench}$: Towards Building Reliable Language Models with Sparse Mixture-of-Experts

Guanjie Chen, Xinyu Zhao, Tianlong Chen, Yu Cheng

arXiv:2406.11353v111.59 citationsh-index: 31Has Code

Originality Incremental advance

AI Analysis

This work addresses the reliability gap for researchers and practitioners using MoE models, providing insights for safer and more stable adaptations, though it is incremental as it focuses on assessment rather than introducing new methods.

The paper tackles the reliability assessment of Sparse Mixture-of-Experts (SMoE) models in large language models, finding that with proper hyperparameters, training, and inference techniques, MoE models can be built more reliably than dense LLMs, as shown through comprehensive testing across safety, adversarial resilience, and out-of-distribution robustness.

Mixture-of-Experts (MoE) has gained increasing popularity as a promising framework for scaling up large language models (LLMs). However, the reliability assessment of MoE lags behind its surging applications. Moreover, when transferred to new domains such as in fine-tuning MoE models sometimes underperform their dense counterparts. Motivated by the research gap and counter-intuitive phenomenon, we propose $\texttt{MoE-RBench}$, the first comprehensive assessment of SMoE reliability from three aspects: $\textit{(i)}$ safety and hallucination, $\textit{(ii)}$ resilience to adversarial attacks, and $\textit{(iii)}$ out-of-distribution robustness. Extensive models and datasets are tested to compare the MoE to dense networks from these reliability dimensions. Our empirical observations suggest that with appropriate hyperparameters, training recipes, and inference techniques, we can build the MoE model more reliably than the dense LLM. In particular, we find that the robustness of SMoE is sensitive to the basic training settings. We hope that this study can provide deeper insights into how to adapt the pre-trained MoE model to other tasks with higher-generation security, quality, and stability. Codes are available at https://github.com/UNITES-Lab/MoE-RBench

View on arXiv PDF Code

Similar