CVAug 7, 2017

Training Deep Networks to be Spatially Sensitive

arXiv:1708.02212v14 citations
Originality Highly original
AI Analysis

This work addresses the need for spatially aware learning objectives in computer vision tasks, offering a practical solution for improving model efficiency and accuracy without extensive pre/post-processing.

The paper tackled the problem of training deep networks for spatially sensitive tasks like saliency prediction by proposing a differentiable and efficient approximation of the Weighted F-measure, which incorporates spatial relationships into the learning objective. This approach matched or improved performance on several tasks compared to prior state-of-the-art methods, with significant gains in some cases using the weighted F-measure.

In many computer vision tasks, for example saliency prediction or semantic segmentation, the desired output is a foreground map that predicts pixels where some criteria is satisfied. Despite the inherently spatial nature of this task commonly used learning objectives do not incorporate the spatial relationships between misclassified pixels and the underlying ground truth. The Weighted F-measure, a recently proposed evaluation metric, does reweight errors spatially, and has been shown to closely correlate with human evaluation of quality, and stably rank predictions with respect to noisy ground truths (such as a sloppy human annotator might generate). However it suffers from computational complexity which makes it intractable as an optimization objective for gradient descent, which must be evaluated thousands or millions of times while learning a model's parameters. We propose a differentiable and efficient approximation of this metric. By incorporating spatial information into the objective we can use a simpler model than competing methods without sacrificing accuracy, resulting in faster inference speeds and alleviating the need for pre/post-processing. We match (or improve) performance on several tasks compared to prior state of the art by traditional metrics, and in many cases significantly improve performance by the weighted F-measure.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes