LG DCMay 30, 2021

Parallelized Computation and Backpropagation Under Angle-Parametrized Orthogonal Matrices

arXiv:2106.00003v1

Originality Incremental advance

AI Analysis

This addresses efficiency bottlenecks in machine learning applications like generative modeling, though it appears incremental as it builds on known graph coloring techniques.

The paper tackles the problem of accelerating learning with orthogonal matrix constraints by restructuring a sequential rotation parametrization into parallelizable blocks, achieving computation in O(n) steps and gradient computation in O(n log n) steps.

We present a methodology for parallel acceleration of learning in the presence of matrix orthogonality and unitarity constraints of interest in several branches of machine learning. We show how an apparently sequential elementary rotation parametrization can be restructured into blocks of commutative operations using a well-known tool for coloring the edges of complete graphs, in turn widely applied to schedule round-robin (all-against-all) sports tournaments. The resulting decomposition admits an algorithm to compute a fully-parametrized orthogonal matrix from its rotation parameters in $O(n)$ sequential steps and one to compute the gradient of a training loss with respect to its parameters in $O(n\log n)$ steps. We discuss parametric restrictions of interest to generative modeling and present promising performance results with a prototype GPU implementation.

View on arXiv PDF

Similar