Diffusing Surrogate Dreams of Video Scenes to Predict Video Memorability
This work addresses video memorability prediction for media analysis, but it is incremental as it builds on existing tasks and methods.
The paper tackled video memorability prediction by using surrogate dream images to represent underlying concepts, achieving state-of-the-art performance in the MediaEval 2022 task.
As part of the MediaEval 2022 Predicting Video Memorability task we explore the relationship between visual memorability, the visual representation that characterises it, and the underlying concept portrayed by that visual representation. We achieve state-of-the-art memorability prediction performance with a model trained and tested exclusively on surrogate dream images, elevating concepts to the status of a cornerstone memorability feature, and finding strong evidence to suggest that the intrinsic memorability of visual content can be distilled to its underlying concept or meaning irrespective of its specific visual representational.