CV LG IVAug 28, 2023

Entropy-based Guidance of Deep Neural Networks for Accelerated Convergence and Improved Performance

Mackenzie J. Meni, Ryan T. White, Michael Mayo, Kevin Pilkiewicz

arXiv:2308.14938v25.013 citationsh-index: 7

Originality Incremental advance

AI Analysis

This work addresses efficiency and interpretability issues in neural networks for image processing applications, representing an incremental improvement through novel loss functions.

The authors tackled the problems of high computational costs and lack of interpretability in neural networks by developing entropy-based loss terms that measure changes in entropy during data processing, resulting in models that converge in fewer training epochs and achieve higher accuracy on image tasks.

Neural networks have dramatically increased our capacity to learn from large, high-dimensional datasets across innumerable disciplines. However, their decisions are not easily interpretable, their computational costs are high, and building and training them are not straightforward processes. To add structure to these efforts, we derive new mathematical results to efficiently measure the changes in entropy as fully-connected and convolutional neural networks process data. By measuring the change in entropy as networks process data effectively, patterns critical to a well-performing network can be visualized and identified. Entropy-based loss terms are developed to improve dense and convolutional model accuracy and efficiency by promoting the ideal entropy patterns. Experiments in image compression, image classification, and image segmentation on benchmark datasets demonstrate these losses guide neural networks to learn rich latent data representations in fewer dimensions, converge in fewer training epochs, and achieve higher accuracy.

View on arXiv PDF

Similar