LGCVNEMLJun 11, 2018

Understanding Patch-Based Learning by Explaining Predictions

arXiv:1806.06926v16 citations
Originality Incremental advance
AI Analysis

This work addresses interpretability in video analysis for researchers, but it is incremental as it applies an existing technique to a new domain.

The paper tackled the problem of understanding classification decisions in deep networks for video data by applying the deep Taylor/LRP technique, revealing a 'border effect' where the classifier focuses on bordering frames, and tuning the step size improved accuracy without retraining.

Deep networks are able to learn highly predictive models of video data. Due to video length, a common strategy is to train them on small video snippets. We apply the deep Taylor / LRP technique to understand the deep network's classification decisions, and identify a "border effect": a tendency of the classifier to look mainly at the bordering frames of the input. This effect relates to the step size used to build the video snippet, which we can then tune in order to improve the classifier's accuracy without retraining the model. To our knowledge, this is the the first work to apply the deep Taylor / LRP technique on any video analyzing neural network.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes