AINENov 28, 2017

Gradual Tuning: a better way of Fine Tuning the parameters of a Deep Neural Network

arXiv:1711.10177v11 citations
Originality Incremental advance
AI Analysis

This addresses catastrophic forgetting in transfer learning, an incremental improvement over standard fine-tuning.

The paper tackles catastrophic forgetting in neural network fine-tuning by introducing Gradual Tuning, which modifies progressively more parameters when adapting to a new task. Results show it significantly reduces forgetting of the initial task while maintaining comparable or better performance on the new task.

In this paper we present an alternative strategy for fine-tuning the parameters of a network. We named the technique Gradual Tuning. Once trained on a first task, the network is fine-tuned on a second task by modifying a progressively larger set of the network's parameters. We test Gradual Tuning on different transfer learning tasks, using networks of different sizes trained with different regularization techniques. The result shows that compared to the usual fine tuning, our approach significantly reduces catastrophic forgetting of the initial task, while still retaining comparable if not better performance on the new task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes