CLMar 19

HumorGen: Cognitive Synergy for Humor Generation in Large Language Models via Persona-Based Distillation

arXiv:2604.0962920.61 citationsh-index: 5

Predicted impact top 77% in CL · last 90 daysOriginality Incremental advance

AI Analysis

It addresses the challenge of generating humor for LLM applications, with incremental improvements in data curation over existing methods.

The paper tackles humor generation in LLMs by introducing a Cognitive Synergy Framework using cognitive personas to create a dataset, fine-tuning a 7B model that outperforms larger baselines and matches proprietary models.

Humor generation poses a significant challenge for Large Language Models (LLMs), because their standard training objective - predicting the most likely next word - inherently conflicts with the surprise and incongruity needed for comedy. To bridge this gap, we introduce the Cognitive Synergy Framework, a theoretically grounded methodology for generating high-quality humor data inspired by psychological theories of humor. Utilizing a Mixture-of-Thought (MoT) approach, we deploy six cognitive personas (e.g., The Absurdist, The Cynic) to synthesize diverse comedic perspectives for a given prompt. This framework creates a theoretically grounded dataset, which we use to fine-tune a 7B-parameter student model. We compare Direct Preference Optimization (DPO) and a novel Offline Group Relative Policy Optimization (O-GRPO); our 7B model significantly outperforms larger instruction-tuned baselines and achieves performance competitive with state-of-the-art proprietary models. We find that cognitive-driven data curation is far more critical than alignment algorithms or model scale for humor generation. Code and data will be available upon publication.

View on arXiv PDF

Similar