LGMLOct 18, 2019

Theoretical Investigation of Composite Neural Network

arXiv:1910.09351v23 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the performance uncertainty of composite neural networks, offering theoretical guarantees for leveraging pre-trained models, though it is incremental in nature.

The paper theoretically investigates composite neural networks, proving that they likely outperform any individual pre-trained component and improve with additional components, supported by empirical evaluations across different applications.

This work theoretically investigates the performance of a composite neural network. A composite neural network is a rooted directed acyclic graph combining a set of pre-trained and non-instantiated neural network models, where a pre-trained neural network model is well-crafted for a specific task and targeted to approximate a specific function with instantiated weights. The advantages of adopting such a pre-trained model in a composite neural network are two folds. One is to benefit from other's intelligence and diligence, and the other is saving the efforts in data preparation and resources and time in training. However, the overall performance of composite neural network is still not clear. In this work, we prove that a composite neural network, with high probability, performs better than any of its pre-trained components under certain assumptions. In addition, if an extra pre-trained component is added to a composite network, with high probability the overall performance will be improved. In the empirical evaluations, distinctively different applications support the above findings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes