CVJun 25, 2020

An Analysis of SVD for Deep Rotation Estimation

arXiv:2006.14616v1120 citations
AI Analysis

This addresses the lack of a universally effective rotation representation for computer vision and robotics, though it appears incremental as it applies an existing mathematical technique to deep learning.

The paper tackled the problem of representing 3D rotations in neural networks by exploring symmetric orthogonalization via SVD, showing that simply replacing existing representations with SVD achieves state-of-the-art performance in many deep learning applications.

Symmetric orthogonalization via SVD, and closely related procedures, are well-known techniques for projecting matrices onto $O(n)$ or $SO(n)$. These tools have long been used for applications in computer vision, for example optimal 3D alignment problems solved by orthogonal Procrustes, rotation averaging, or Essential matrix decomposition. Despite its utility in different settings, SVD orthogonalization as a procedure for producing rotation matrices is typically overlooked in deep learning models, where the preferences tend toward classic representations like unit quaternions, Euler angles, and axis-angle, or more recently-introduced methods. Despite the importance of 3D rotations in computer vision and robotics, a single universally effective representation is still missing. Here, we explore the viability of SVD orthogonalization for 3D rotations in neural networks. We present a theoretical analysis that shows SVD is the natural choice for projecting onto the rotation group. Our extensive quantitative analysis shows simply replacing existing representations with the SVD orthogonalization procedure obtains state of the art performance in many deep learning applications covering both supervised and unsupervised training.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes