LGCVMLJan 31, 2023

Learning Data Representations with Joint Diffusion Models

arXiv:2301.13622v227 citationsh-index: 35
AI Analysis

This work addresses the challenge of building stable and high-performing joint models for data synthesis and classification, which is incremental as it builds upon existing diffusion models.

The paper tackles the problem of uneven performance and instability in joint models for data synthesis and classification by proposing a joint diffusion model with a classifier for stable end-to-end training, which outperforms recent state-of-the-art hybrid methods in both classification and generation quality on all evaluated benchmarks.

Joint machine learning models that allow synthesizing and classifying data often offer uneven performance between those tasks or are unstable to train. In this work, we depart from a set of empirical observations that indicate the usefulness of internal representations built by contemporary deep diffusion-based generative models not only for generating but also predicting. We then propose to extend the vanilla diffusion model with a classifier that allows for stable joint end-to-end training with shared parameterization between those objectives. The resulting joint diffusion model outperforms recent state-of-the-art hybrid methods in terms of both classification and generation quality on all evaluated benchmarks. On top of our joint training approach, we present how we can directly benefit from shared generative and discriminative representations by introducing a method for visual counterfactual explanations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes