LGAIOPTICSDec 9, 2022

Dual adaptive training of photonic neural networks

arXiv:2212.06141v166 citationsh-index: 39
Originality Incremental advance
AI Analysis

This addresses a critical bottleneck for deploying large-scale PNNs as analog AI accelerators, though it is incremental as it builds on existing training methods.

The paper tackled the problem of systematic errors degrading performance in large-scale photonic neural networks (PNNs) by proposing dual adaptive training (DAT), which preserved classification accuracies comparable to error-free systems and outperformed state-of-the-art in situ training approaches.

Photonic neural network (PNN) is a remarkable analog artificial intelligence (AI) accelerator that computes with photons instead of electrons to feature low latency, high energy efficiency, and high parallelism. However, the existing training approaches cannot address the extensive accumulation of systematic errors in large-scale PNNs, resulting in a significant decrease in model performance in physical systems. Here, we propose dual adaptive training (DAT) that allows the PNN model to adapt to substantial systematic errors and preserves its performance during the deployment. By introducing the systematic error prediction networks with task-similarity joint optimization, DAT achieves the high similarity mapping between the PNN numerical models and physical systems and high-accurate gradient calculations during the dual backpropagation training. We validated the effectiveness of DAT by using diffractive PNNs and interference-based PNNs on image classification tasks. DAT successfully trained large-scale PNNs under major systematic errors and preserved the model classification accuracies comparable to error-free systems. The results further demonstrated its superior performance over the state-of-the-art in situ training approaches. DAT provides critical support for constructing large-scale PNNs to achieve advanced architectures and can be generalized to other types of AI systems with analog computing errors.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes