LGDec 24, 2020

Learning with Retrospection

arXiv:2012.13098v120 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of discarding past epoch information during DNN training, aiming to improve accuracy, calibration, and robustness for practitioners using deep learning models. This is an incremental improvement to existing training procedures.

This paper proposes Learning with Retrospection (LWR), a training framework for deep neural networks that utilizes information from past epochs to guide subsequent training. LWR improves accuracies, calibration, and robustness of DNNs without adding network parameters or inference cost, with only negligible training overhead.

Deep neural networks have been successfully deployed in various domains of artificial intelligence, including computer vision and natural language processing. We observe that the current standard procedure for training DNNs discards all the learned information in the past epochs except the current learned weights. An interesting question is: is this discarded information indeed useless? We argue that the discarded information can benefit the subsequent training. In this paper, we propose learning with retrospection (LWR) which makes use of the learned information in the past epochs to guide the subsequent training. LWR is a simple yet effective training framework to improve accuracies, calibration, and robustness of DNNs without introducing any additional network parameters or inference cost, only with a negligible training overhead. Extensive experiments on several benchmark datasets demonstrate the superiority of LWR for training DNNs.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes