CVAug 28, 2017

Automatic Discovery and Geotagging of Objects from Street View Imagery

Vladimir A. Krylov, Eamonn Kenny, Rozenn Dahyot

arXiv:1708.08417v299 citations

Originality Incremental advance

AI Analysis

This provides accurate object mapping for applications such as autonomous navigation and urban planning, though it is incremental as it builds on existing neural network and triangulation methods.

The paper tackles the problem of automatically detecting and geolocating stationary objects like traffic lights and telegraph poles from street view imagery, achieving high recall rates and GPS accuracy within 2 meters.

Many applications such as autonomous navigation, urban planning and asset monitoring, rely on the availability of accurate information about objects and their geolocations. In this paper we propose to automatically detect and compute the GPS coordinates of recurring stationary objects of interest using street view imagery. Our processing pipeline relies on two fully convolutional neural networks: the first segments objects in the images while the second estimates their distance from the camera. To geolocate all the detected objects coherently we propose a novel custom Markov Random Field model to perform objects triangulation. The novelty of the resulting pipeline is the combined use of monocular depth estimation and triangulation to enable automatic mapping of complex scenes with multiple visually similar objects of interest. We validate experimentally the effectiveness of our approach on two object classes: traffic lights and telegraph poles. The experiments report high object recall rates and GPS accuracy within 2 meters, which is comparable with the precision of single-frequency GPS receivers.

View on arXiv PDF

Similar