LGCVMay 31, 2016

Dynamic Filter Networks

arXiv:1605.09673v21089 citations
AI Analysis

This addresses the problem of adaptive feature extraction in neural networks for tasks like video and stereo prediction, offering a novel framework with potential for unsupervised pretraining.

The paper tackles the limitation of fixed filters in traditional convolutional layers by introducing Dynamic Filter Networks, which generate filters conditioned on input, achieving state-of-the-art performance on the moving MNIST dataset with a smaller model.

In a traditional convolutional layer, the learned filters stay fixed after training. In contrast, we introduce a new framework, the Dynamic Filter Network, where filters are generated dynamically conditioned on an input. We show that this architecture is a powerful one, with increased flexibility thanks to its adaptive nature, yet without an excessive increase in the number of model parameters. A wide variety of filtering operations can be learned this way, including local spatial transformations, but also others like selective (de)blurring or adaptive feature extraction. Moreover, multiple such layers can be combined, e.g. in a recurrent architecture. We demonstrate the effectiveness of the dynamic filter network on the tasks of video and stereo prediction, and reach state-of-the-art performance on the moving MNIST dataset with a much smaller model. By visualizing the learned filters, we illustrate that the network has picked up flow information by only looking at unlabelled training data. This suggests that the network can be used to pretrain networks for various supervised tasks in an unsupervised way, like optical flow and depth estimation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes