MLLGMay 24, 2022

EBM Life Cycle: MCMC Strategies for Synthesis, Defense, and Density Modeling

arXiv:2205.12243v11 citationsh-index: 91Has Code
Originality Highly original
AI Analysis

It addresses the challenge of flexible EBM training for computer vision tasks, offering incremental improvements through novel MCMC initialization methods.

The paper tackles the problem of learning Energy-Based Models (EBMs) by adapting MCMC sampling trajectory lengths for different applications, achieving state-of-the-art FID scores on CIFAR-10 and ImageNet for image generation, top adversarial defense on CIFAR-10, and the first EBM defense on ImageNet.

This work presents strategies to learn an Energy-Based Model (EBM) according to the desired length of its MCMC sampling trajectories. MCMC trajectories of different lengths correspond to models with different purposes. Our experiments cover three different trajectory magnitudes and learning outcomes: 1) shortrun sampling for image generation; 2) midrun sampling for classifier-agnostic adversarial defense; and 3) longrun sampling for principled modeling of image probability densities. To achieve these outcomes, we introduce three novel methods of MCMC initialization for negative samples used in Maximum Likelihood (ML) learning. With standard network architectures and an unaltered ML objective, our MCMC initialization methods alone enable significant performance gains across the three applications that we investigate. Our results include state-of-the-art FID scores for unnormalized image densities on the CIFAR-10 and ImageNet datasets; state-of-the-art adversarial defense on CIFAR-10 among purification methods and the first EBM defense on ImageNet; and scalable techniques for learning valid probability densities. Code for this project can be found at https://github.com/point0bar1/ebm-life-cycle.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes