CVFeb 2, 2021

Orientation Convolutional Networks for Image Recognition

Yalan Qin, Guorui Feng, Hanzhou Wu, Yanli Ren, Xinpeng Zhang

arXiv:2102.01523v11.4

Originality Incremental advance

AI Analysis

This work offers an incremental improvement in image recognition for tasks requiring robustness to object orientation changes, particularly for practitioners seeking more efficient and accurate models.

This paper addresses the limitation of Deep Convolutional Neural Networks (DCNNs) in modeling orientation transformations by introducing Orientation Convolution Networks (OCNs) based on Landmark Gabor Filters (LGFs). The proposed OCNs achieve higher performance and reduced sensitivity to orientation changes on several benchmarks, while also having fewer parameters and lower training complexity compared to existing state-of-the-art methods.

Deep Convolutional Neural Networks (DCNNs) are capable of obtaining powerful image representations, which have attracted great attentions in image recognition. However, they are limited in modeling orientation transformation by the internal mechanism. In this paper, we develop Orientation Convolution Networks (OCNs) for image recognition based on the proposed Landmark Gabor Filters (LGFs) that the robustness of the learned representation against changed of orientation can be enhanced. By modulating the convolutional filter with LGFs, OCNs can be compatible with any existing deep learning networks. LGFs act as a Gabor filter bank achieved by selecting $ p $ $ \left( \ll n\right) $ representative Gabor filters as andmarks and express the original Gabor filters as sparse linear combinations of these landmarks. Specifically, based on a matrix factorization framework, a flexible integration for the local and the global structure of original Gabor filters by sparsity and low-rank constraints is utilized. With the propogation of the low-rank structure, the corresponding sparsity for representation of original Gabor filter bank can be significantly promoted. Experimental results over several benchmarks demonstrate that our method is less sensitive to the orientation and produce higher performance both in accuracy and cost, compared with the existing state-of-art methods. Besides, our OCNs have few parameters to learn and can significantly reduce the complexity of training network.

View on arXiv PDF

Similar