CV CLSep 16, 2022

Belief Revision based Caption Re-ranker with Visual Semantic Information

Ahmed Sabir, Francesc Moreno-Noguer, Pranava Madhyastha, Lluís Padró

arXiv:2209.08163v148.6580 citationsh-index: 48Has Code

Originality Incremental advance

AI Analysis

This work addresses the need for better caption accuracy in image-captioning systems, but it is incremental as it builds on existing re-ranking and belief revision methods.

The paper tackles the problem of improving captions from image-caption generation systems by proposing a re-ranking approach that uses visual-semantic measures and the Belief Revision framework to select the best caption, resulting in enhanced performance without additional training.

In this work, we focus on improving the captions generated by image-caption generation systems. We propose a novel re-ranking approach that leverages visual-semantic measures to identify the ideal caption that maximally captures the visual information in the image. Our re-ranker utilizes the Belief Revision framework (Blok et al., 2003) to calibrate the original likelihood of the top-n captions by explicitly exploiting the semantic relatedness between the depicted caption and the visual context. Our experiments demonstrate the utility of our approach, where we observe that our re-ranker can enhance the performance of a typical image-captioning system without the necessity of any additional training or fine-tuning.

View on arXiv PDF Code

Similar