CV AI GTOct 21, 2017

ADA: A Game-Theoretic Perspective on Data Augmentation for Object Detection

Sima Behpour, Kris M. Kitani, Brian D. Ziebart

arXiv:1710.07735v21.71 citations

Originality Highly original

AI Analysis

This work addresses the challenge of principled data augmentation design for object detection, offering a novel method that could benefit researchers and practitioners in computer vision.

The paper tackles the problem of designing optimal data augmentation for object detection by proposing a game-theoretic approach to find adversarial perturbations of ground truth data, resulting in a 16% performance improvement on ImageNet compared to the best existing method.

The use of random perturbations of ground truth data, such as random translation or scaling of bounding boxes, is a common heuristic used for data augmentation that has been shown to prevent overfitting and improve generalization. Since the design of data augmentation is largely guided by reported best practices, it is difficult to understand if those design choices are optimal. To provide a more principled perspective, we develop a game-theoretic interpretation of data augmentation in the context of object detection. We aim to find an optimal adversarial perturbations of the ground truth data (i.e., the worst case perturbations) that forces the object bounding box predictor to learn from the hardest distribution of perturbed examples for better test-time performance. We establish that the game theoretic solution, the Nash equilibrium, provides both an optimal predictor and optimal data augmentation distribution. We show that our adversarial method of training a predictor can significantly improve test time performance for the task of object detection. On the ImageNet object detection task, our adversarial approach improves performance by over 16\% compared to the best performing data augmentation method

View on arXiv PDF

Similar