Affine Normal Directions via Log-Determinant Geometry: Scalable Computation under Sparse Polynomial Structure

Yi-Shuai Niu, Artan Sheshmani, Shing-Tung Yau

arXiv:2604.0116396.52 citations

AI Analysis

This provides a practical computational method for affine normal evaluation in large-scale structured optimization, though it is incremental as it builds on existing affine geometry concepts.

The paper tackles the computational bottleneck in computing affine normal directions, which require expensive third-order derivatives and Hessian inversions, by showing these directions can be exactly reduced to second-order operations using log-determinant gradients. This leads to scalable matrix-free algorithms with near-linear complexity for sparse polynomial objectives, achieving substantial runtime improvements while maintaining machine precision.

Affine normal directions provide intrinsic affine-invariant descent directions derived from the geometry of level sets. Their practical use, however, has long been hindered by the need to evaluate third-order derivatives and invert tangent Hessians, which becomes computationally prohibitive in high dimensions. In this paper, we show that affine normal computation admits an exact reduction to second-order structure: the classical third-order contraction term is precisely the gradient of the log-determinant of the tangent Hessian. This identity replaces explicit third-order tensor contraction by a matrix-free formulation based on tangent linear solves, Hessian-vector products, and log-determinant gradient evaluation. Building on this reduction, we develop exact and stochastic matrix-free procedures for affine normal evaluation. For sparse polynomial objectives, the algebraic closure of derivatives further yields efficient sparse kernels for gradients, Hessian-vector products, and directional third-order contractions, leading to scalable implementations whose cost is governed by the sparsity structure of the polynomial representation. We establish end-to-end complexity bounds showing near-linear scaling with respect to the relevant sparsity scale under fixed stochastic and Krylov budgets. Numerical experiments confirm that the proposed MF-LogDet formulation reproduces the original autodifferentiation-based affine normal direction to near machine precision, delivers substantial runtime improvements in moderate and high dimensions, and exhibits empirical near-linear scaling in both dimension and sparsity. These results provide a practical computational route for affine normal evaluation and reveal a new connection between affine differential geometry, log-determinant curvature, and large-scale structured optimization.

View on arXiv PDF

Similar