CVOct 22, 2017

Deep Cropping via Attention Box Prediction and Aesthetics Assessment

arXiv:1710.08014v1107 citations
Originality Incremental advance
AI Analysis

This addresses automated photo cropping for users needing efficient aesthetic enhancement, but it is incremental as it builds on existing deep learning methods.

The paper tackles photo cropping by combining attention box prediction and aesthetics assessment in a deep learning cascade, achieving competitive results and a processing speed of 5 fps.

We model the photo cropping problem as a cascade of attention box regression and aesthetic quality classification, based on deep learning. A neural network is designed that has two branches for predicting attention bounding box and analyzing aesthetics, respectively. The predicted attention box is treated as an initial crop window where a set of cropping candidates are generated around it, without missing important information. Then, aesthetics assessment is employed to select the final crop as the one with the best aesthetic quality. With our network, cropping candidates share features within full-image convolutional feature maps, thus avoiding repeated feature computation and leading to higher computation efficiency. Via leveraging rich data for attention prediction and aesthetics assessment, the proposed method produces high-quality cropping results, even with the limited availability of training data for photo cropping. The experimental results demonstrate the competitive results and fast processing speed (5 fps with all steps).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes