CVAug 22, 2017

A Spatiotemporal Oriented Energy Network for Dynamic Texture Recognition

arXiv:1708.06690v130 citations
Originality Incremental advance
AI Analysis

This work addresses dynamic texture recognition for computer vision applications, offering a novel, interpretable method with incremental improvements in performance.

The paper tackled dynamic texture recognition by introducing a hierarchical spatiotemporal orientation network that combines ConvNet-like architecture with analytical design, achieving new state-of-the-art results on multiple standard datasets.

This paper presents a novel hierarchical spatiotemporal orientation representation for spacetime image analysis. It is designed to combine the benefits of the multilayer architecture of ConvNets and a more controlled approach to spacetime analysis. A distinguishing aspect of the approach is that unlike most contemporary convolutional networks no learning is involved; rather, all design decisions are specified analytically with theoretical motivations. This approach makes it possible to understand what information is being extracted at each stage and layer of processing as well as to minimize heuristic choices in design. Another key aspect of the network is its recurrent nature, whereby the output of each layer of processing feeds back to the input. To keep the network size manageable across layers, a novel cross-channel feature pooling is proposed. The multilayer architecture that results systematically reveals hierarchical image structure in terms of multiscale, multiorientation properties of visual spacetime. To illustrate its utility, the network has been applied to the task of dynamic texture recognition. Empirical evaluation on multiple standard datasets shows that it sets a new state-of-the-art.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes