CVNov 12, 2021

Frequency learning for structured CNN filters with Gaussian fractional derivatives

arXiv:2111.06660v111 citations
Originality Synthesis-oriented
AI Analysis

This work addresses a domain-specific problem in computer vision for improving texture discrimination in CNNs, representing an incremental advancement over existing Gaussian derivative methods.

The paper tackles the problem of limited frequency learning in CNNs due to fixed filter sizes by learning the order of Gaussian derivative basis functions, enabling adaptation to optimal frequencies for the task. This approach achieves parameter savings and data efficiency compared to standard CNNs and Gaussian derivative CNN filter networks.

Frequency information lies at the base of discriminating between textures, and therefore between different objects. Classical CNN architectures limit the frequency learning through fixed filter sizes, and lack a way of explicitly controlling it. Here, we build on the structured receptive field filters with Gaussian derivative basis. Yet, rather than using predetermined derivative orders, which typically result in fixed frequency responses for the basis functions, we learn these. We show that by learning the order of the basis we can accurately learn the frequency of the filters, and hence adapt to the optimal frequencies for the underlying learning task. We investigate the well-founded mathematical formulation of fractional derivatives to adapt the filter frequencies during training. Our formulation leads to parameter savings and data efficiency when compared to the standard CNNs and the Gaussian derivative CNN filter networks that we build upon.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes