CVOct 15, 2024

Feature-guided score diffusion for sampling conditional densities

arXiv:2410.11646v13 citationsh-index: 9
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in conditional density sampling for researchers in generative modeling, offering an incremental improvement over existing score diffusion variants.

The paper tackles the challenge of correctly estimating scores for conditional densities in score diffusion methods by introducing a feature-guided algorithm that uses a projected score to push image feature vectors towards class centroids, resulting in high-quality and diverse conditional sample generation with demonstrated out-of-distribution generalization.

Score diffusion methods can learn probability densities from samples. The score of the noise-corrupted density is estimated using a deep neural network, which is then used to iteratively transport a Gaussian white noise density to a target density. Variants for conditional densities have been developed, but correct estimation of the corresponding scores is difficult. We avoid these difficulties by introducing an algorithm that guides the diffusion with a projected score. The projection pushes the image feature vector towards the feature vector centroid of the target class. The projected score and the feature vectors are learned by the same network. Specifically, the image feature vector is defined as the spatial averages of the channels activations in select layers of the network. Optimizing the projected score for denoising loss encourages image feature vectors of each class to cluster around their centroids. It also leads to the separations of the centroids. We show that these centroids provide a low-dimensional Euclidean embedding of the class conditional densities. We demonstrate that the algorithm can generate high quality and diverse samples from the conditioning class. Conditional generation can be performed using feature vectors interpolated between those of the training set, demonstrating out-of-distribution generalization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes