CLAug 8, 2024

Open-domain Implicit Format Control for Large Language Model Generation

Yiqun Yao, Wenjia Ma, Xuezhi Fang, Xin Jiang, Xiang Li, Xuying Meng, Peng Han, Jing Li, Aixin Sun, Yequan Wang

arXiv:2408.04392v12.72 citationsh-index: 63Has Code

Originality Incremental advance

AI Analysis

This addresses the challenge of flexible format control in LLMs for applications requiring diverse outputs, representing an incremental improvement over existing constrained methods.

The paper tackles the problem of controlling output formats in large language models for open-domain applications, introducing a framework that uses one-shot QA pairs and a dataset collection method to improve format control without degrading quality, achieving enhanced performance on a new benchmark.

Controlling the format of outputs generated by large language models (LLMs) is a critical functionality in various applications. Current methods typically employ constrained decoding with rule-based automata or fine-tuning with manually crafted format instructions, both of which struggle with open-domain format requirements. To address this limitation, we introduce a novel framework for controlled generation in LLMs, leveraging user-provided, one-shot QA pairs. This study investigates LLMs' capabilities to follow open-domain, one-shot constraints and replicate the format of the example answers. We observe that this is a non-trivial problem for current LLMs. We also develop a dataset collection methodology for supervised fine-tuning that enhances the open-domain format control of LLMs without degrading output quality, as well as a benchmark on which we evaluate both the helpfulness and format correctness of LLM outputs. The resulting datasets, named OIFC-SFT, along with the related code, will be made publicly available at https://github.com/cofe-ai/OIFC.

View on arXiv PDF Code

Similar