LGCVPFJul 16, 2017

Comparative Performance Analysis of Neural Networks Architectures on H2O Platform for Various Activation Functions

arXiv:1707.04940v112 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental study that addresses optimization issues for practitioners in machine learning, particularly in image recognition.

The paper tackled the problem of inefficient parameter selection in neural networks by testing architectures on the H2O platform with various activation functions, showing that blind parameter choices can increase runtime by 2-3 orders of magnitude without significantly improving precision on the MNIST dataset.

Deep learning (deep structured learning, hierarchi- cal learning or deep machine learning) is a branch of machine learning based on a set of algorithms that attempt to model high- level abstractions in data by using multiple processing layers with complex structures or otherwise composed of multiple non-linear transformations. In this paper, we present the results of testing neural networks architectures on H2O platform for various activation functions, stopping metrics, and other parameters of machine learning algorithm. It was demonstrated for the use case of MNIST database of handwritten digits in single-threaded mode that blind selection of these parameters can hugely increase (by 2-3 orders) the runtime without the significant increase of precision. This result can have crucial influence for opitmization of available and new machine learning methods, especially for image recognition problems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes