QUANT-PHLGApr 20, 2025

Quantum-Enhanced Weight Optimization for Neural Networks Using Grover's Algorithm

arXiv:2504.14568v15 citationsh-index: 2
Originality Incremental advance
AI Analysis

This addresses the optimization bottlenecks in neural network training for researchers and practitioners, offering a scalable and practical quantum approach, though it is incremental as it builds on existing quantum search techniques.

The authors tackled the problem of optimizing neural network weights by proposing a quantum-enhanced method using Grover's algorithm, which avoids gradient-based issues and achieved a 58.75% reduction in test loss and 35.25% improvement in test accuracy on small datasets, with scalability demonstrated by a 97.7% mean accuracy on a 3-hidden-layer network.

The main approach to hybrid quantum-classical neural networks (QNN) is employing quantum computing to build a neural network (NN) that has quantum features, which is then optimized classically. Here, we propose a different strategy: to use quantum computing in order to optimize the weights of a classical NN. As such, we design an instance of Grover's quantum search algorithm to accelerate the search for the optimal parameters of an NN during the training process, a task traditionally performed using the backpropagation algorithm with the gradient descent method. Indeed, gradient descent has issues such as exploding gradient, vanishing gradient, or convexity problem. Other methods tried to address such issues with strategies like genetic searches, but they carry additional problems like convergence consistency. Our original method avoids these issues -- because it does not calculate gradients -- and capitalizes on classical architectures' robustness and Grover's quadratic speedup in high-dimensional search spaces to significantly reduce test loss (58.75%) and improve test accuracy (35.25%), compared to classical NN weight optimization, on small datasets. Unlike most QNNs that are trained on small datasets only, our method is also scalable, as it allows the optimization of deep networks; for an NN with 3 hidden layers, trained on the Digits dataset from scikit-learn, we obtained a mean accuracy of 97.7%. Moreover, our method requires a much smaller number of qubits compared to other QNN approaches, making it very practical for near-future quantum computers that will still deliver a limited number of logical qubits.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes