LG AI NEApr 26, 2022

Hybridised Loss Functions for Improved Neural Network Generalisation

Matthew C. Dickson, Anna S. Bosman, Katherine M. Malan

arXiv:2204.12244v13.322 citationsh-index: 20

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of enhancing neural network performance for machine learning practitioners, but it is incremental as it builds on known complementary properties of existing loss functions.

The study tackled the problem of improving neural network generalization by investigating hybrid loss functions that combine cross entropy and sum squared error, showing that these hybrids improve generalization on all tested problems, with a specific hybrid (starting with sum squared error and switching to cross entropy) performing best or not significantly different from the best.

Loss functions play an important role in the training of artificial neural networks (ANNs), and can affect the generalisation ability of the ANN model, among other properties. Specifically, it has been shown that the cross entropy and sum squared error loss functions result in different training dynamics, and exhibit different properties that are complementary to one another. It has previously been suggested that a hybrid of the entropy and sum squared error loss functions could combine the advantages of the two functions, while limiting their disadvantages. The effectiveness of such hybrid loss functions is investigated in this study. It is shown that hybridisation of the two loss functions improves the generalisation ability of the ANNs on all problems considered. The hybrid loss function that starts training with the sum squared error loss function and later switches to the cross entropy error loss function is shown to either perform the best on average, or to not be significantly different than the best loss function tested for all problems considered. This study shows that the minima discovered by the sum squared error loss function can be further exploited by switching to cross entropy error loss function. It can thus be concluded that hybridisation of the two loss functions could lead to better performance in ANNs.

View on arXiv PDF

Similar