CL LGFeb 12, 2024

Label-Efficient Model Selection for Text Generation

Shir Ashury-Tahan, Ariel Gera, Benjamin Sznajder, Leshem Choshen, Liat Ein-Dor, Eyal Shnarch

arXiv:2402.07891v317.133 citationsh-index: 18ACL

Originality Highly original

AI Analysis

This addresses the resource-intensive process of model selection for text generation tasks, offering a practical solution for researchers and practitioners.

The paper tackles the problem of costly model selection in text generation by introducing DiffUse, a method that reduces the required preference annotations by up to 75% while maintaining high evaluation reliability.

Model selection for a given target task can be costly, as it may entail extensive annotation of the quality of outputs of different models. We introduce DiffUse, an efficient method to make an informed decision between candidate text generation models based on preference annotations. DiffUse reduces the required amount of annotations, thus saving valuable time and resources in performing evaluation. DiffUse intelligently selects instances by clustering embeddings that represent the semantic differences between model outputs. Thus, it is able to identify a subset of examples that are more informative for preference decisions. Our method is model-agnostic, and can be applied to any text generation model for selecting between models, prompts and configurations. Moreover, we propose a practical iterative approach for dynamically determining how many instances to annotate. In a series of experiments over hundreds of model pairs, we demonstrate that DiffUse can dramatically reduce the required number of annotations -- by up to 75% -- while maintaining high evaluation reliability.

View on arXiv PDF

Similar