CV LG OCMar 9, 2022

Resource-Efficient Invariant Networks: Exponential Gains by Unrolled Optimization

Sam Buchanan, Jingkai Yan, Ellie Haber, John Wright

arXiv:2203.05006v13.73 citationsh-index: 8Has Code

Originality Highly original

AI Analysis

This addresses the problem of handling natural variabilities like pose changes in vision systems, offering a novel method for known bottlenecks with potential broad impact.

The paper tackles the challenge of building invariant vision systems that scale exponentially with transformation dimensions by proposing a new optimization-based primitive, which provides provably more efficient high-dimensional exploration than sampling and demonstrates utility in an efficient invariant network for hierarchical object detection.

Achieving invariance to nuisance transformations is a fundamental challenge in the construction of robust and reliable vision systems. Existing approaches to invariance scale exponentially with the dimension of the family of transformations, making them unable to cope with natural variabilities in visual data such as changes in pose and perspective. We identify a common limitation of these approaches--they rely on sampling to traverse the high-dimensional space of transformations--and propose a new computational primitive for building invariant networks based instead on optimization, which in many scenarios provides a provably more efficient method for high-dimensional exploration than sampling. We provide empirical and theoretical corroboration of the efficiency gains and soundness of our proposed method, and demonstrate its utility in constructing an efficient invariant network for a simple hierarchical object detection task when combined with unrolled optimization. Code for our networks and experiments is available at https://github.com/sdbuch/refine.

View on arXiv PDF Code

Similar