Sequential training algorithm for neural networks
This method addresses computational inefficiency for researchers and practitioners working with large neural networks, but it is incremental as it builds on existing training algorithms with minimal modifications.
The paper tackles the computational challenge of training large-scale feedforward neural networks by proposing a sequential training method that decouples and trains each layer separately, reducing resource usage and improving convergence, though it results in sub-optimal performance compared to full network training.
A sequential training method for large-scale feedforward neural networks is presented. Each layer of the neural network is decoupled and trained separately. After the training is completed for each layer, they are combined together. The performance of the network would be sub-optimal compared to the full network training if the optimal solution would be achieved. However, achieving the optimal solution for the full network would be infeasible or require long computing time. The proposed sequential approach reduces the required computer resources significantly and would have better convergences as a single layer is optimised for each optimisation step. The required modifications of existing algorithms to implement the sequential training are minimal. The performance is verified by a simple example.