CVMar 2, 2020

MVC-Net: A Convolutional Neural Network Architecture for Manifold-Valued Images With Applications

arXiv:2003.01234v26 citations
AI Analysis

This addresses a problem for researchers and practitioners in medical imaging and computer vision who work with exotic data types like manifold-valued images, representing a novel method for a known bottleneck.

The paper tackles the problem of generalizing convolutional neural networks to handle manifold-valued images, where each pixel is a sample from a Riemannian manifold, by introducing the Manifold-Valued Convolution (MVC) operation and MVC-net architecture. It demonstrates superior performance in medical imaging and computer vision tasks, though no concrete numbers are provided in the abstract.

Geometric deep learning has attracted significant attention in recent years, in part due to the availability of exotic data types for which traditional neural network architectures are not well suited. Our goal in this paper is to generalize convolutional neural networks (CNN) to the manifold-valued image case which arises commonly in medical imaging and computer vision applications. Explicitly, the input data to the network is an image where each pixel value is a sample from a Riemannian manifold. To achieve this goal, we must generalize the basic building block of traditional CNN architectures, namely, the weighted combinations operation. To this end, we develop a tangent space combination operation which is used to define a convolution operation on manifold-valued images that we call, the Manifold-Valued Convolution (MVC). We prove theoretical properties of the MVC operation, including equivariance to the action of the isometry group admitted by the manifold and characterizing when compositions of MVC layers collapse to a single layer. We present a detailed description of how to use MVC layers to build full, multi-layer neural networks that operate on manifold-valued images, which we call the MVC-net. Further, we empirically demonstrate superior performance of the MVC-nets in medical imaging and computer vision tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes