CVFeb 10

DEGMC: Denoising Diffusion Models Based on Riemannian Equivariant Group Morphological Convolutions

El Hadji S. Diop, Thierno Fall, Mohamed Daoudi

arXiv:2602.10221v11.5

Originality Incremental advance

AI Analysis

This addresses geometric limitations in diffusion models for image generation, though it appears incremental as it builds directly on existing DDPM architecture.

The authors tackled two issues in Denoising Diffusion Probabilistic Models (DDPMs) - geometric feature extraction and network equivariance - by introducing Riemannian equivariant group morphological convolutions, resulting in noticeable improvements on MNIST, RotoMNIST, and CIFAR-10 datasets compared to baseline DDPM.

In this work, we address two major issues in recent Denoising Diffusion Probabilistic Models (DDPM): {\bf 1)} geometric key feature extraction and {\bf 2)} network equivariance. Since the DDPM prediction network relies on the U-net architecture, which is theoretically only translation equivariant, we introduce a geometric approach combined with an equivariance property of the more general Euclidean group, which includes rotations, reflections, and permutations. We introduce the notion of group morphological convolutions in Riemannian manifolds, which are derived from the viscosity solutions of first-order Hamilton-Jacobi-type partial differential equations (PDEs) that act as morphological multiscale dilations and erosions. We add a convection term to the model and solve it using the method of characteristics. This helps us better capture nonlinearities, represent thin geometric structures, and incorporate symmetries into the learning process. Experimental results on the MNIST, RotoMNIST, and CIFAR-10 datasets show noticeable improvements compared to the baseline DDPM model.

View on arXiv PDF

Similar