NECVOct 25, 2018

Structure Learning of Deep Networks via DNA Computing Algorithm

arXiv:1810.10687v1
Originality Incremental advance
AI Analysis

This addresses the need for automated network design in deep learning, offering a novel approach but with incremental gains compared to existing methods.

The paper tackles the problem of automatically designing high-performance convolutional neural network architectures by introducing a DNA computing algorithm, achieving accuracies of 99.73% on MNIST and 95.10% on CIFAR-10.

Convolutional Neural Network (CNN) has gained state-of-the-art results in many pattern recognition and computer vision tasks. However, most of the CNN structures are manually designed by experienced researchers. Therefore, auto- matically building high performance networks becomes an important problem. In this paper, we introduce the idea of using DNA computing algorithm to automatically learn high-performance architectures. In DNA computing algorithm, we use short DNA strands to represent layers and long DNA strands to represent overall networks. We found that most of the learned models perform similarly, and only those performing worse during the first runs of training will perform worse finally than others. The indicates that: 1) Using DNA computing algorithm to learn deep architectures is feasible; 2) Local minima should not be a problem of deep networks; 3) We can use early stop to kill the models with the bad performance just after several runs of training. In our experiments, an accuracy 99.73% was obtained on the MNIST data set and an accuracy 95.10% was obtained on the CIFAR-10 data set.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes