CLApr 5, 2024

Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation

arXiv:2404.04232v228 citationsh-index: 8ACL
AI Analysis

This addresses a crucial gap for researchers and practitioners in text generation by providing a benchmark and method to enhance generalization, though it is incremental as it builds on existing MCTG approaches.

The paper tackles the lack of a benchmark for compositional generalization in multi-aspect controllable text generation by introducing CompMCTG, and proposes Meta-MCTG, a meta-learning framework that improves performance by up to 3.64% in most cases.

Compositional generalization, representing the model's ability to generate text with new attribute combinations obtained by recombining single attributes from the training data, is a crucial property for multi-aspect controllable text generation (MCTG) methods. Nonetheless, a comprehensive compositional generalization evaluation benchmark of MCTG is still lacking. We propose CompMCTG, a benchmark encompassing diverse multi-aspect labeled datasets and a crafted three-dimensional evaluation protocol, to holistically evaluate the compositional generalization of MCTG approaches. We observe that existing MCTG works generally confront a noticeable performance drop in compositional testing. To mitigate this issue, we introduce Meta-MCTG, a training framework incorporating meta-learning, where we enable models to learn how to generalize by simulating compositional generalization scenarios in the training phase. We demonstrate the effectiveness of Meta-MCTG through achieving obvious improvement (by at most 3.64%) for compositional testing performance in 94.4% cases.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes