LGMar 19, 2013

Large-Scale Learning with Less RAM via Randomization

arXiv:1303.4664v125 citations
Originality Incremental advance
AI Analysis

This addresses memory constraints for practitioners deploying large-scale machine learning models, though it is incremental as it builds on existing online learning methods.

The paper tackles the problem of high memory usage in large-scale online learning by using randomized rounding to project weight vectors onto coarse discrete sets, reducing RAM usage by over 50% during training and up to 95% for predictions with minimal accuracy loss.

We reduce the memory footprint of popular large-scale online learning methods by projecting our weight vector onto a coarse discrete set using randomized rounding. Compared to standard 32-bit float encodings, this reduces RAM usage by more than 50% during training and by up to 95% when making predictions from a fixed model, with almost no loss in accuracy. We also show that randomized counting can be used to implement per-coordinate learning rates, improving model quality with little additional RAM. We prove these memory-saving methods achieve regret guarantees similar to their exact variants. Empirical evaluation confirms excellent performance, dominating standard approaches across memory versus accuracy tradeoffs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes