CLAIOct 7, 2022

Visualize Before You Write: Imagination-Guided Open-Ended Text Generation

arXiv:2210.03765v4290 citationsh-index: 63
Originality Incremental advance
AI Analysis

This work addresses the challenge of enhancing creativity and coherence in text generation for applications like writing assistance, though it is incremental as it builds on existing text-to-image and language model technologies.

The paper tackles the problem of open-ended text generation by using machine-generated images to guide language models, resulting in coherent and informative text with minor degeneration across tasks like story generation and text completion.

Recent advances in text-to-image synthesis make it possible to visualize machine imaginations for a given context. On the other hand, when generating text, human writers are gifted at creative visualization, which enhances their writings by forming imaginations as blueprints before putting down the stories in words. Inspired by such a cognitive process, we ask the natural question of whether we can endow machines with the same ability to utilize visual information and construct a general picture of the context to guide text generation. In this work, we propose iNLG that uses machine-generated images to guide language models in open-ended text generation. The experiments and analyses demonstrate the effectiveness of iNLG on open-ended text generation tasks, including text completion, story generation, and concept-to-text generation in both few-shot and full-data scenarios. Both automatic metrics and human evaluations verify that the text snippets generated by our iNLG are coherent and informative while displaying minor degeneration.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes