MLLGMar 14, 2018

Generalised Structural CNNs (SCNNs) for time series data with arbitrary graph topology

arXiv:1803.05419v22 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of generalizing CNNs for time series data with arbitrary graph structures, which is incremental as it extends existing CNN methods to handle non-lattice topologies like small-world networks or trees.

The authors tackled the problem of applying convolutional neural networks to time series data with non-regular graph topologies, such as human kinematics, by developing a framework for graph-structured CNNs that automatically builds convolutional layers based on adjacency matrices and hop distance scaling, and demonstrated that including graph structure significantly improves model predictions compared to a benchmark CNN with only time convolution layers.

Deep Learning methods, specifically convolutional neural networks (CNNs), have seen a lot of success in the domain of image-based data, where the data offers a clearly structured topology in the regular lattice of pixels. This 4-neighbourhood topological simplicity makes the application of convolutional masks straightforward for time series data, such as video applications, but many high-dimensional time series data are not organised in regular lattices, and instead values may have adjacency relationships with non-trivial topologies, such as small-world networks or trees. In our application case, human kinematics, it is currently unclear how to generalise convolutional kernels in a principled manner. Therefore we define and implement here a framework for general graph-structured CNNs for time series analysis. Our algorithm automatically builds convolutional layers using the specified adjacency matrix of the data dimensions and convolutional masks that scale with the hop distance. In the limit of a lattice-topology our method produces the well-known image convolutional masks. We test our method first on synthetic data of arbitrarily-connected graphs and human hand motion capture data, where the hand is represented by a tree capturing the mechanical dependencies of the joints. We are able to demonstrate, amongst other things, that inclusion of the graph structure of the data dimensions improves model prediction significantly, when compared against a benchmark CNN model with only time convolution layers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes