LGFeb 11, 2025

MatrixKAN: Parallelized Kolmogorov-Arnold Network

arXiv:2502.07176v24 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses a computational bottleneck for researchers and practitioners using KANs, representing an incremental improvement.

The paper tackled the slow training and inference speeds of Kolmogorov-Arnold Networks (KANs) by proposing MatrixKAN, a parallelized optimization using matrix operations, which achieved speedups of approximately 40x relative to KAN.

Kolmogorov-Arnold Networks (KAN) are a new class of neural network architecture representing a promising alternative to the Multilayer Perceptron (MLP), demonstrating improved expressiveness and interpretability. However, KANs suffer from slow training and inference speeds relative to MLPs due in part to the recursive nature of the underlying B-spline calculations. This issue is particularly apparent with respect to KANs utilizing high-degree B-splines, as the number of required non-parallelizable recursions is proportional to B-spline degree. We solve this issue by proposing MatrixKAN, a novel optimization that parallelizes B-spline calculations with matrix representation and operations, thus significantly improving effective computation time for models utilizing high-degree B-splines. In this paper, we demonstrate the superior scaling of MatrixKAN's computation time relative to B-spline degree. Further, our experiments demonstrate speedups of approximately 40x relative to KAN, with significant additional speedup potential for larger datasets or higher spline degrees.

View on arXiv PDF

Similar