CLAIMay 20, 2025

Creative Preference Optimization

arXiv:2505.14442v217 citationsh-index: 9EMNLP
Originality Highly original
AI Analysis

This addresses the challenge of enhancing multifaceted creativity in LLMs for content generation applications, representing a novel methodological advance rather than an incremental improvement.

The paper tackled the problem of limited creativity in Large Language Models by proposing Creative Preference Optimization (CrPO), which injects multiple creativity dimensions into preference optimization, resulting in models that outperform GPT-4o on automated and human evaluations by producing more novel, diverse, and surprising generations while maintaining quality.

While Large Language Models (LLMs) have demonstrated impressive performance across natural language generation tasks, their ability to generate truly creative content-characterized by novelty, diversity, surprise, and quality-remains limited. Existing methods for enhancing LLM creativity often focus narrowly on diversity or specific tasks, failing to address creativity's multifaceted nature in a generalizable way. In this work, we propose Creative Preference Optimization (CrPO), a novel alignment method that injects signals from multiple creativity dimensions into the preference optimization objective in a modular fashion. We train and evaluate creativity-augmented versions of several models using CrPO and MuCE, a new large-scale human preference dataset spanning over 200,000 human-generated responses and ratings from more than 30 psychological creativity assessments. Our models outperform strong baselines, including GPT-4o, on both automated and human evaluations, producing more novel, diverse, and surprising generations while maintaining high output quality. Additional evaluations on NoveltyBench further confirm the generalizability of our approach. Together, our results demonstrate that directly optimizing for creativity within preference frameworks is a promising direction for advancing the creative capabilities of LLMs without compromising output quality.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes