CVNov 12, 2021

Frequency learning for structured CNN filters with Gaussian fractional derivatives

Nikhil Saldanha, Silvia L. Pintea, Jan C. van Gemert, Nergis Tomen

arXiv:2111.06660v18.011 citations

Originality Synthesis-oriented

AI Analysis

This work addresses a domain-specific problem in computer vision for improving texture discrimination in CNNs, representing an incremental advancement over existing Gaussian derivative methods.

The paper tackles the problem of limited frequency learning in CNNs due to fixed filter sizes by learning the order of Gaussian derivative basis functions, enabling adaptation to optimal frequencies for the task. This approach achieves parameter savings and data efficiency compared to standard CNNs and Gaussian derivative CNN filter networks.

Frequency information lies at the base of discriminating between textures, and therefore between different objects. Classical CNN architectures limit the frequency learning through fixed filter sizes, and lack a way of explicitly controlling it. Here, we build on the structured receptive field filters with Gaussian derivative basis. Yet, rather than using predetermined derivative orders, which typically result in fixed frequency responses for the basis functions, we learn these. We show that by learning the order of the basis we can accurately learn the frequency of the filters, and hence adapt to the optimal frequencies for the underlying learning task. We investigate the well-founded mathematical formulation of fractional derivatives to adapt the filter frequencies during training. Our formulation leads to parameter savings and data efficiency when compared to the standard CNNs and the Gaussian derivative CNN filter networks that we build upon.

View on arXiv PDF

Similar