CLSep 11, 2019

What Makes A Good Story? Designing Composite Rewards for Visual Storytelling

arXiv:1909.05316v273 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving story quality in visual storytelling for applications like AI-generated content, though it is incremental as it builds on existing reinforcement learning methods.

The paper tackled the problem of generating high-quality visual stories by proposing a reinforcement learning framework with reward functions based on relevance, coherence, and expressiveness, achieving better performance than state-of-the-art baselines on both traditional and new criteria in experiments on the VIST dataset.

Previous storytelling approaches mostly focused on optimizing traditional metrics such as BLEU, ROUGE and CIDEr. In this paper, we re-examine this problem from a different angle, by looking deep into what defines a realistically-natural and topically-coherent story. To this end, we propose three assessment criteria: relevance, coherence and expressiveness, which we observe through empirical analysis could constitute a "high-quality" story to the human eye. Following this quality guideline, we propose a reinforcement learning framework, ReCo-RL, with reward functions designed to capture the essence of these quality criteria. Experiments on the Visual Storytelling Dataset (VIST) with both automatic and human evaluations demonstrate that our ReCo-RL model achieves better performance than state-of-the-art baselines on both traditional metrics and the proposed new criteria.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes