CL AI IR LGDec 2, 2021

LOGEN: Few-shot Logical Knowledge-Conditioned Text Generation with Self-training

Shumin Deng, Jiacheng Yang, Hongbin Ye, Chuanqi Tan, Mosha Chen, Songfang Huang, Fei Huang, Huajun Chen, Ningyu Zhang

arXiv:2112.01404v31.29 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of data-hungry methods for logical knowledge-conditioned text generation, making it more applicable to real-world scenarios with limited data.

The paper tackles the problem of generating text from structured data with logical knowledge in few-shot settings, achieving better performance than baselines using only 20 to 100 seed logical forms through self-training and pseudo-form sampling.

Natural language generation from structured data mainly focuses on surface-level descriptions, suffering from uncontrollable content selection and low fidelity. Previous works leverage logical forms to facilitate logical knowledge-conditioned text generation. Though achieving remarkable progress, they are data-hungry, which makes the adoption for real-world applications challenging with limited data. To this end, this paper proposes a unified framework for logical knowledge-conditioned text generation in the few-shot setting. With only a few seeds logical forms (e.g., 20/100 shot), our approach leverages self-training and samples pseudo logical forms based on content and structure consistency. Experimental results demonstrate that our approach can obtain better few-shot performance than baselines.

View on arXiv PDF

Similar