LGMar 5

Preserving Continuous Symmetry in Discrete Spaces: Geometric-Aware Quantization for SO(3)-Equivariant GNNs

Haoyu Zhou, Ping Xue, Hao Zhang, Tianfan Fu

arXiv:2603.05343v1

Originality Highly original

AI Analysis

This work addresses the high computational cost and memory bottlenecks of equivariant GNNs, which are crucial for physically consistent molecular simulations, by enabling efficient low-bit quantization while preserving essential symmetries.

This paper introduces Geometric-Aware Quantization (GAQ) to enable low-bit quantization for SO(3)-equivariant GNNs without destroying their symmetry. GAQ achieves FP32 accuracy (9.31 meV vs. 23.20 meV) with W4A8 models, reduces Local Equivariance Error by over 30x, and provides 2.39x inference speedup and 4x memory reduction.

Equivariant Graph Neural Networks (GNNs) are essential for physically consistent molecular simulations but suffer from high computational costs and memory bottlenecks, especially with high-order representations. While low-bit quantization offers a solution, applying it naively to rotation-sensitive features destroys the SO(3)-equivariant structure, leading to significant errors and violations of conservation laws. To address this issue, in this work, we propose a Geometric-Aware Quantization (GAQ) framework that compresses and accelerates equivariant models while rigorously preserving continuous symmetry in discrete spaces. Our approach introduces three key contributions: (1) a Magnitude-Direction Decoupled Quantization (MDDQ) scheme that separates invariant lengths from equivariant orientations to maintain geometric fidelity; (2) a symmetry-aware training strategy that treats scalar and vector features with distinct quantization schedules; and (3) a robust attention normalization mechanism to stabilize gradients in low-bit regimes. Experiments on the rMD17 benchmark demonstrate that our W4A8 models match the accuracy of FP32 baselines (9.31 meV vs. 23.20 meV) while reducing Local Equivariance Error (LEE) by over 30x compared to naive quantization. On consumer hardware, GAQ achieves 2.39x inference speedup and 4x memory reduction, enabling stable, energy-conserving molecular dynamics simulations for nanosecond timescales.

View on arXiv PDF

Similar