MLJun 1, 2017

Efficient learning with robust gradient descent

arXiv:1706.00182v328 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of robust learning for tasks with imperfect data, presenting an incremental improvement over standard empirical risk minimization.

The paper tackles the problem of learning efficiently with noisy or heavy-tailed data by introducing a robust gradient approximation method, showing that it generalizes better and uses fewer resources in both simulations and real-world benchmarks.

Minimizing the empirical risk is a popular training strategy, but for learning tasks where the data may be noisy or heavy-tailed, one may require many observations in order to generalize well. To achieve better performance under less stringent requirements, we introduce a procedure which constructs a robust approximation of the risk gradient for use in an iterative learning routine. Using high-probability bounds on the excess risk of this algorithm, we show that our update does not deviate far from the ideal gradient-based update. Empirical tests using both controlled simulations and real-world benchmark data show that in diverse settings, the proposed procedure can learn more efficiently, using less resources (iterations and observations) while generalizing better.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes