NEAIMay 19, 2018

Number Sequence Prediction Problems for Evaluating Computational Powers of Neural Networks

arXiv:1805.07494v215 citations
Originality Incremental advance
AI Analysis

This provides a benchmark for assessing neural networks' algorithmic problem-solving abilities, though it is incremental in evaluating existing models.

The authors introduced number sequence prediction tasks to evaluate neural networks' computational powers, defining complexity via automata and logic structures. Experiments showed CNNs, GRUs, LSTMs, and memory-augmented models could solve some tasks up to pushdown automata complexity but failed on sequences requiring queue automata or Turing machines.

Inspired by number series tests to measure human intelligence, we suggest number sequence prediction tasks to assess neural network models' computational powers for solving algorithmic problems. We define the complexity and difficulty of a number sequence prediction task with the structure of the smallest automaton that can generate the sequence. We suggest two types of number sequence prediction problems: the number-level and the digit-level problems. The number-level problems format sequences as 2-dimensional grids of digits and the digit-level problems provide a single digit input per a time step. The complexity of a number-level sequence prediction can be defined with the depth of an equivalent combinatorial logic, and the complexity of a digit-level sequence prediction can be defined with an equivalent state automaton for the generation rule. Experiments with number-level sequences suggest that CNN models are capable of learning the compound operations of sequence generation rules, but the depths of the compound operations are limited. For the digit-level problems, simple GRU and LSTM models can solve some problems with the complexity of finite state automata. Memory augmented models such as Stack-RNN, Attention, and Neural Turing Machines can solve the reverse-order task which has the complexity of simple pushdown automaton. However, all of above cannot solve general Fibonacci, Arithmetic or Geometric sequence generation problems that represent the complexity of queue automata or Turing machines. The results show that our number sequence prediction problems effectively evaluate machine learning models' computational capabilities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes