MLMar 21, 2016

Data Augmentation via Levy Processes

Stefan Wager, William Fithian, Percy Liang

arXiv:1603.06340v14.63 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses the need for robust data augmentation methods in machine learning, offering a general framework that unifies existing techniques like Gaussian noising and dropout, but it is incremental as it builds on known concepts without introducing a new paradigm.

The paper tackles the problem of incorporating invariance properties into discriminative classifiers by proposing a data augmentation framework based on Levy processes, which generates pseudo-examples by slicing the process earlier in time and preserves the Bayes decision boundary while connecting to generative models in the limit.

If a document is about travel, we may expect that short snippets of the document should also be about travel. We introduce a general framework for incorporating these types of invariances into a discriminative classifier. The framework imagines data as being drawn from a slice of a Levy process. If we slice the Levy process at an earlier point in time, we obtain additional pseudo-examples, which can be used to train the classifier. We show that this scheme has two desirable properties: it preserves the Bayes decision boundary, and it is equivalent to fitting a generative model in the limit where we rewind time back to 0. Our construction captures popular schemes such as Gaussian feature noising and dropout training, as well as admitting new generalizations.

View on arXiv PDF Code

Similar