LGDCMay 30, 2021

Parallelized Computation and Backpropagation Under Angle-Parametrized Orthogonal Matrices

arXiv:2106.00003v1
Originality Incremental advance
AI Analysis

This addresses efficiency bottlenecks in machine learning applications like generative modeling, though it appears incremental as it builds on known graph coloring techniques.

The paper tackles the problem of accelerating learning with orthogonal matrix constraints by restructuring a sequential rotation parametrization into parallelizable blocks, achieving computation in O(n) steps and gradient computation in O(n log n) steps.

We present a methodology for parallel acceleration of learning in the presence of matrix orthogonality and unitarity constraints of interest in several branches of machine learning. We show how an apparently sequential elementary rotation parametrization can be restructured into blocks of commutative operations using a well-known tool for coloring the edges of complete graphs, in turn widely applied to schedule round-robin (all-against-all) sports tournaments. The resulting decomposition admits an algorithm to compute a fully-parametrized orthogonal matrix from its rotation parameters in $O(n)$ sequential steps and one to compute the gradient of a training loss with respect to its parameters in $O(n\log n)$ steps. We discuss parametric restrictions of interest to generative modeling and present promising performance results with a prototype GPU implementation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes