LG AI MLMar 2, 2023

Large Deviations for Accelerating Neural Networks Training

Sreelekha Guggilam, Varun Chandola, Abani Patra

arXiv:2303.00954v12.0h-index: 33

Originality Incremental advance

AI Analysis

This work addresses training inefficiency in neural networks for classification tasks, but it is exploratory and incremental with opportunities for future development.

The authors tackled the problem of inefficient training in artificial neural networks due to redundant data features by proposing LIIT, a novel training approach using large deviations theory to generate representative samples, which reduced training time without significant performance loss.

Artificial neural networks (ANNs) require tremendous amount of data to train on. However, in classification models, most data features are often similar which can lead to increase in training time without significant improvement in the performance. Thus, we hypothesize that there could be a more efficient way to train an ANN using a better representative sample. For this, we propose the LAD Improved Iterative Training (LIIT), a novel training approach for ANN using large deviations principle to generate and iteratively update training samples in a fast and efficient setting. This is exploratory work with extensive opportunities for future work. The thesis presents this ongoing research work with the following contributions from this study: (1) We propose a novel ANN training method, LIIT, based on the large deviations theory where additional dimensionality reduction is not needed to study high dimensional data. (2) The LIIT approach uses a Modified Training Sample (MTS) that is generated and iteratively updated using a LAD anomaly score based sampling strategy. (3) The MTS sample is designed to be well representative of the training data by including most anomalous of the observations in each class. This ensures distinct patterns and features are learnt with smaller samples. (4) We study the classification performance of the LIIT trained ANNs with traditional batch trained counterparts.

View on arXiv PDF

Similar