CVFeb 26, 2024

Efficient 3D affinely equivariant CNNs with adaptive fusion of augmented spherical Fourier-Bessel bases

arXiv:2402.16825v41 citationsh-index: 7Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of inefficient and inflexible 3D group equivariant CNNs for volumetric medical image segmentation, representing an incremental improvement over existing methods.

The paper tackled the underperformance of filter-decomposition-based group equivariant CNNs in 3D medical image segmentation by proposing an efficient non-parameter-sharing continuous 3D affine group equivariant neural network, which achieved better affine group equivariance and superior segmentation accuracy on four datasets, significantly improving training stability and data efficiency.

Filter-decomposition-based group equivariant convolutional neural networks (CNNs) have shown promising stability and data efficiency for 3D image feature extraction. However, these networks, which rely on parameter sharing and discrete transformation groups, often underperform in modern deep neural network architectures for processing volumetric images, such as the common 3D medical images. To address these limitations, this paper presents an efficient non-parameter-sharing continuous 3D affine group equivariant neural network for volumetric images. This network uses an adaptive aggregation of Monte Carlo augmented spherical Fourier-Bessel filter bases to improve the efficiency and flexibility of 3D group equivariant CNNs for volumetric data. Unlike existing methods that focus only on angular orthogonality in filter bases, the introduced spherical Bessel Fourier filter base incorporates both angular and radial orthogonality to improve feature extraction. Experiments on four medical image segmentation datasets show that the proposed methods achieve better affine group equivariance and superior segmentation accuracy than existing 3D group equivariant convolutional neural network layers, significantly improving the training stability and data efficiency of conventional CNN layers (at 0.05 significance level). The code is available at https://github.com/ZhaoWenzhao/WMCSFB.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes