LGOCMLDec 27, 2019

SoftAdapt: Techniques for Adaptive Loss Weighting of Neural Networks with Multi-Part Loss Functions

arXiv:1912.12355v195 citations
Originality Incremental advance
AI Analysis

This addresses the issue of slow convergence and poor loss weighting in adaptive loss functions for researchers and practitioners in deep learning, representing an incremental improvement over heuristic approaches.

The paper tackles the problem of suboptimal weighting in multi-part loss functions for neural networks, proposing SoftAdapt, a family of methods that dynamically adjusts loss weights based on live performance statistics, resulting in improved convergence and performance in tasks like image reconstruction and synthetic data generation.

Adaptive loss function formulation is an active area of research and has gained a great deal of popularity in recent years, following the success of deep learning. However, existing frameworks of adaptive loss functions often suffer from slow convergence and poor choice of weights for the loss components. Traditionally, the elements of a multi-part loss function are weighted equally or their weights are determined through heuristic approaches that yield near-optimal (or sub-optimal) results. To address this problem, we propose a family of methods, called SoftAdapt, that dynamically change function weights for multi-part loss functions based on live performance statistics of the component losses. SoftAdapt is mathematically intuitive, computationally efficient and straightforward to implement. In this paper, we present the mathematical formulation and pseudocode for SoftAdapt, along with results from applying our methods to image reconstruction (Sparse Autoencoders) and synthetic data generation (Introspective Variational Autoencoders).

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes