CV LGMay 5

Raising the Ceiling: Better Empirical Fixation Densities for Saliency Benchmarking

Susmit Agrawal, Jannis Hollman, Matthias Kümmerer

arXiv:2605.0388529.7

Predicted impact top 86% in CV · last 90 daysOriginality Incremental advance

AI Analysis

For researchers benchmarking saliency models, this work provides a more reliable method for estimating fixation densities, which directly impacts leaderboard rankings and failure case analyses.

The paper proposes a principled mixture model for estimating empirical fixation densities from eye-tracking data, achieving median per-image gains of 5-15% in log-likelihood and up to 2 percentage points in AUC over the standard fixed-bandwidth KDE method. The improved estimates reveal that state-of-the-art saliency models still have significant room for improvement.

Empirical fixation densities, spatial distributions estimated from human eye-tracking data, are foundational to saliency benchmarking. They directly shape benchmark conclusions, leaderboard rankings, failure case analyses, and scientific claims about human visual behavior. Yet the standard estimation method, fixed-bandwidth isotropic Gaussian KDE, has gone essentially unchanged for decades. This matters now more than ever: as the field shifts toward sample-level evaluation (failure case analysis, inverse benchmarking, per-image model comparison), reliable per-image density estimates become critical. We propose a principled mixture model that combines an adaptive-bandwidth KDE based on Abramson's method, center bias and uniform components, and a state-of-the-art saliency model, to capture different spatial and semantic types of interobserver consistency, and optimize all parameters per image via leave-one-subject-out cross-validation. Our method yields substantially higher interobserver consistency estimates across multiple benchmarks, with median per-image gains of 5-15% in log-likelihood and up to 2 percentage points in AUC. For the most affected images -- precisely those most relevant to failure case analysis -- improvements exceed 25%. We leverage these improved estimates to identify and analyze remaining failure cases of state-of-the-art saliency models, demonstrating that significant headroom for model improvement remains. More broadly, our findings highlight that empirical fixation densities should not be treated as fixed ground truths but as evolving estimates that improve with better methodology.

View on arXiv PDF

Similar