LG NEOct 2, 2014

Deep Sequential Neural Network

arXiv:1410.0510v162 citations

Originality Incremental advance

AI Analysis

This is an incremental improvement for neural network design, offering a more flexible architecture that can adapt to diverse data characteristics.

The paper tackles the problem of limited expressiveness in classical multilayer neural networks by introducing a model where each layer selects from candidate mappings via a sequential decision process, enabling data-specific transformation paths and increasing expression power.

Neural Networks sequentially build high-level features through their successive layers. We propose here a new neural network model where each layer is associated with a set of candidate mappings. When an input is processed, at each layer, one mapping among these candidates is selected according to a sequential decision process. The resulting model is structured according to a DAG like architecture, so that a path from the root to a leaf node defines a sequence of transformations. Instead of considering global transformations, like in classical multilayer networks, this model allows us for learning a set of local transformations. It is thus able to process data with different characteristics through specific sequences of such local transformations, increasing the expression power of this model w.r.t a classical multilayered network. The learning algorithm is inspired from policy gradient techniques coming from the reinforcement learning domain and is used here instead of the classical back-propagation based gradient descent techniques. Experiments on different datasets show the relevance of this approach.

View on arXiv PDF

Similar