CVSep 2, 2020

Structure-Aware Generation Network for Recipe Generation from Images

arXiv:2009.00944v110.636 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the need for automated recipe generation for social media and food applications, but it is an incremental improvement over existing image captioning methods.

The paper tackles the problem of automatically generating cooking instructions from food images and ingredients, proposing a Structure-aware Generation Network (SGN) that achieves state-of-the-art performance on the Recipe1M dataset.

Sharing food has become very popular with the development of social media. For many real-world applications, people are keen to know the underlying recipes of a food item. In this paper, we are interested in automatically generating cooking instructions for food. We investigate an open research task of generating cooking instructions based on only food images and ingredients, which is similar to the image captioning task. However, compared with image captioning datasets, the target recipes are long-length paragraphs and do not have annotations on structure information. To address the above limitations, we propose a novel framework of Structure-aware Generation Network (SGN) to tackle the food recipe generation task. Our approach brings together several novel ideas in a systematic framework: (1) exploiting an unsupervised learning approach to obtain the sentence-level tree structure labels before training; (2) generating trees of target recipes from images with the supervision of tree structure labels learned from (1); and (3) integrating the inferred tree structures with the recipe generation procedure. Our proposed model can produce high-quality and coherent recipes, and achieve the state-of-the-art performance on the benchmark Recipe1M dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes