CVOct 13, 2017

Dynamic texture recognition using time-causal and time-recursive spatio-temporal receptive fields

arXiv:1710.04842v33 citations
Originality Incremental advance
AI Analysis

This is an incremental improvement for video analysis, specifically in dynamic texture recognition, by extending existing spatial methods to the spatio-temporal domain.

This work tackles dynamic texture recognition by proposing video descriptors based on time-causal spatio-temporal receptive fields, achieving competitive performance with state-of-the-art methods and showing that binary versions of these descriptors outperform many similar approaches using handcrafted or learned primitives.

This work presents a first evaluation of using spatio-temporal receptive fields from a recently proposed time-causal spatio-temporal scale-space framework as primitives for video analysis. We propose a new family of video descriptors based on regional statistics of spatio-temporal receptive field responses and evaluate this approach on the problem of dynamic texture recognition. Our approach generalises a previously used method, based on joint histograms of receptive field responses, from the spatial to the spatio-temporal domain and from object recognition to dynamic texture recognition. The time-recursive formulation enables computationally efficient time-causal recognition. The experimental evaluation demonstrates competitive performance compared to state-of-the-art. Especially, it is shown that binary versions of our dynamic texture descriptors achieve improved performance compared to a large range of similar methods using different primitives either handcrafted or learned from data. Further, our qualitative and quantitative investigation into parameter choices and the use of different sets of receptive fields highlights the robustness and flexibility of our approach. Together, these results support the descriptive power of this family of time-causal spatio-temporal receptive fields, validate our approach for dynamic texture recognition and point towards the possibility of designing a range of video analysis methods based on these new time-causal spatio-temporal primitives.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes