LGMLDec 17, 2018

On the Continuity of Rotation Representations in Neural Networks

arXiv:1812.07035v41750 citations
Originality Incremental advance
AI Analysis

This addresses a fundamental issue for researchers and practitioners in graphics and vision by providing more learnable rotation representations, though it is incremental as it builds on existing topological concepts.

The paper tackles the problem of discontinuous rotation representations in neural networks, showing that common representations like quaternions are discontinuous in low dimensions, and proposes continuous representations in higher dimensions that outperform discontinuous ones in tasks like 3D point cloud rotation estimation and inverse kinematics.

In neural networks, it is often desirable to work with various representations of the same space. For example, 3D rotations can be represented with quaternions or Euler angles. In this paper, we advance a definition of a continuous representation, which can be helpful for training deep neural networks. We relate this to topological concepts such as homeomorphism and embedding. We then investigate what are continuous and discontinuous representations for 2D, 3D, and n-dimensional rotations. We demonstrate that for 3D rotations, all representations are discontinuous in the real Euclidean spaces of four or fewer dimensions. Thus, widely used representations such as quaternions and Euler angles are discontinuous and difficult for neural networks to learn. We show that the 3D rotations have continuous representations in 5D and 6D, which are more suitable for learning. We also present continuous representations for the general case of the n-dimensional rotation group SO(n). While our main focus is on rotations, we also show that our constructions apply to other groups such as the orthogonal group and similarity transforms. We finally present empirical results, which show that our continuous rotation representations outperform discontinuous ones for several practical problems in graphics and vision, including a simple autoencoder sanity test, a rotation estimator for 3D point clouds, and an inverse kinematics solver for 3D human poses.

Code Implementations6 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes