LGJun 3, 2023

Automated Architecture Synthesis for Arbitrarily Structured Neural Networks

arXiv:2306.02157v4h-index: 4

Originality Highly original

AI Analysis

This addresses the problem of restricted network collaboration and capability in AI/ML by enabling more flexible and biologically-inspired neural architectures.

The paper tackles the limitation of traditional neural networks with preset or DAG structures by proposing a framework that learns arbitrary graph structures during training, inspired by biological neural systems, resulting in reduced overfitting and improved efficiency through parallel computing.

This paper offers a new perspective on Artificial Neural Networks (ANNs) architecture. Traditional ANNs commonly use tree-like or DAG structures for simplicity, which can be preset or determined by Neural Architecture Search (NAS). Yet, these structures restrict network collaboration and capability due to the absence of horizontal and backward communication. Biological neural systems, however, feature billions of neural units with highly complex connections, allowing each biological neuron to connect with others based on specific situations. Inspired by biological systems, we propose a novel framework that learns to construct arbitrary graph structures during training and introduce the concept of Neural Modules for organizing neural units, which facilitates communication between any nodes and collaboration among modules. Unlike traditional NAS methods that rely on DAG search spaces, our framework learns from complete graphs, enabling free communication between neurons akin to biological neural networks. Furthermore, we present a method to compute these structures and a regularization technique that organizes them into multiple independent, balanced neural modules. This approach reduces overfitting and improves efficiency through parallel computing. Overall, our method allows ANNs to learn effective arbitrary structures similar to biological ones. It is adaptable to various tasks and compatible across different scenarios, with experimental results demonstrating its potential.

View on arXiv PDF

Similar