LGApr 22, 2017

Batch-Expansion Training: An Efficient Optimization Framework

Michał Dereziński, Dhruv Mahajan, S. Sathiya Keerthi, S. V. N. Vishwanathan, Markus Weimer

arXiv:1704.06731v32.66 citations

Originality Incremental advance

AI Analysis

This addresses resource efficiency in distributed and disk-constrained settings for machine learning practitioners, but it is incremental as it builds on existing batch optimizers.

The paper tackles the problem of inefficient data access in optimization by proposing Batch-Expansion Training (BET), a framework that runs batch optimizers on a gradually expanding dataset, achieving an optimal O(1/ε) data-access convergence rate for strongly convex objectives and showing better performance in experiments.

We propose Batch-Expansion Training (BET), a framework for running a batch optimizer on a gradually expanding dataset. As opposed to stochastic approaches, batches do not need to be resampled i.i.d. at every iteration, thus making BET more resource efficient in a distributed setting, and when disk-access is constrained. Moreover, BET can be easily paired with most batch optimizers, does not require any parameter-tuning, and compares favorably to existing stochastic and batch methods. We show that when the batch size grows exponentially with the number of outer iterations, BET achieves optimal $O(1/ε)$ data-access convergence rate for strongly convex objectives. Experiments in parallel and distributed settings show that BET performs better than standard batch and stochastic approaches.

View on arXiv PDF

Similar