CVJul 2, 2016

Active Object Localization in Visual Situations

Max H. Quinn, Anthony D. Rhodes, Melanie Mitchell

arXiv:1607.00548v13.87 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of object localization in abstract visual scenarios for computer vision applications, representing an incremental improvement with domain-specific focus.

The paper tackles the problem of actively localizing objects in visual situations by combining given and learned knowledge of spatial and semantic structures, and demonstrates strong benefits in efficiency compared to several baselines.

We describe a method for performing active localization of objects in instances of visual situations. A visual situation is an abstract concept---e.g., "a boxing match", "a birthday party", "walking the dog", "waiting for a bus"---whose image instantiations are linked more by their common spatial and semantic structure than by low-level visual similarity. Our system combines given and learned knowledge of the structure of a particular situation, and adapts that knowledge to a new situation instance as it actively searches for objects. More specifically, the system learns a set of probability distributions describing spatial and other relationships among relevant objects. The system uses those distributions to iteratively sample object proposals on a test image, but also continually uses information from those object proposals to adaptively modify the distributions based on what the system has detected. We test our approach's ability to efficiently localize objects, using a situation-specific image dataset created by our group. We compare the results with several baselines and variations on our method, and demonstrate the strong benefit of using situation knowledge and active context-driven localization. Finally, we contrast our method with several other approaches that use context as well as active search for object localization in images.

View on arXiv PDF

Similar