CVJul 4, 2019

Deep Saliency Models : The Quest For The Loss Function

arXiv:1907.02336v121 citations
Originality Incremental advance
AI Analysis

This work addresses the often-neglected problem of loss function selection in deep saliency prediction for computer vision researchers, offering incremental improvements through novel combinations.

The paper investigates the impact of loss function choice on deep saliency models, showing that modifying loss functions can significantly improve or degrade results, and introduces new loss functions and a linear combination approach that leads to significant performance improvements on various datasets and architectures.

Recent advances in deep learning have pushed the performances of visual saliency models way further than it has ever been. Numerous models in the literature present new ways to design neural networks, to arrange gaze pattern data, or to extract as much high and low-level image features as possible in order to create the best saliency representation. However, one key part of a typical deep learning model is often neglected: the choice of the loss function. In this work, we explore some of the most popular loss functions that are used in deep saliency models. We demonstrate that on a fixed network architecture, modifying the loss function can significantly improve (or depreciate) the results, hence emphasizing the importance of the choice of the loss function when designing a model. We also introduce new loss functions that have never been used for saliency prediction to our knowledge. And finally, we show that a linear combination of several well-chosen loss functions leads to significant improvements in performances on different datasets as well as on a different network architecture, hence demonstrating the robustness of a combined metric.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes