CVCLLGMay 20, 2019

Image Captioning based on Deep Learning Methods: A Survey

arXiv:1905.08110v12 citations
Originality Synthesis-oriented
AI Analysis

It provides a comprehensive overview for researchers and practitioners in AI, but is incremental as it summarizes existing work without introducing new methods.

This paper surveys recent advances in image captioning using deep learning methods, covering encoder-decoder structures, improvements in encoders and decoders, and other enhancements, while also discussing future research directions.

Image captioning is a challenging task and attracting more and more attention in the field of Artificial Intelligence, and which can be applied to efficient image retrieval, intelligent blind guidance and human-computer interaction, etc. In this paper, we present a survey on advances in image captioning based on Deep Learning methods, including Encoder-Decoder structure, improved methods in Encoder, improved methods in Decoder, and other improvements. Furthermore, we discussed future research directions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes