LGETIVOPTICSMar 20, 2022

Hybrid training of optical neural networks

arXiv:2203.11207v268 citationsh-index: 46
AI Analysis

This work addresses a critical bottleneck for researchers developing energy-efficient optical hardware for machine learning, though it is incremental as it builds on existing training methods.

The paper tackles the reality gap in optical neural networks caused by physical imperfections by introducing hybrid training, where weight matrices are trained using optically computed neuron activations, and demonstrates robustness against static noise in comparative studies.

Optical neural networks are emerging as a promising type of machine learning hardware capable of energy-efficient, parallel computation. Today's optical neural networks are mainly developed to perform optical inference after in silico training on digital simulators. However, various physical imperfections that cannot be accurately modelled may lead to the notorious reality gap between the digital simulator and the physical system. To address this challenge, we demonstrate hybrid training of optical neural networks where the weight matrix is trained with neuron activation functions computed optically via forward propagation through the network. We examine the efficacy of hybrid training with three different networks: an optical linear classifier, a hybrid opto-electronic network, and a complex-valued optical network. We perform a comparative study to in silico training, and our results show that hybrid training is robust against different kinds of static noise. Our platform-agnostic hybrid training scheme can be applied to a wide variety of optical neural networks, and this work paves the way towards advanced all-optical training in machine intelligence.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes