CVOct 6, 2016

Searching Scenes by Abstracting Things

Svetlana Kordumova, Jan C. van Gemert, Cees G. M. Snoek, Arnold W. M. Smeulders

arXiv:1610.01801v11.1

Originality Incremental advance

AI Analysis

This work addresses scene retrieval for computer vision applications by proposing a novel, learning-free method, though it is incremental as it builds on existing object proposal techniques.

The paper tackles scene retrieval by representing scenes as abstractions of 'things' based on simple observable properties like position and color, without object identification, and shows that this approach can retrieve scenes reasonably well without any learning.

In this paper we propose to represent a scene as an abstraction of 'things'. We start from 'things' as generated by modern object proposals, and we investigate their immediately observable properties: position, size, aspect ratio and color, and those only. Where the recent successes and excitement of the field lie in object identification, we represent the scene composition independent of object identities. We make three contributions in this work. First, we study simple observable properties of 'things', and call it things syntax. Second, we propose translating the things syntax in linguistic abstract statements and study their descriptive effect to retrieve scenes. Thirdly, we propose querying of scenes with abstract block illustrations and study their effectiveness to discriminate among different types of scenes. The benefit of abstract statements and block illustrations is that we generate them directly from the images, without any learning beforehand as in the standard attribute learning. Surprisingly, we show that even though we use the simplest of features from 'things' layout and no learning at all, we can still retrieve scenes reasonably well.

View on arXiv PDF

Similar