LGQMMLApr 30, 2024

Leveraging Active Subspaces to Capture Epistemic Model Uncertainty in Deep Generative Models for Molecular Design

arXiv:2405.00202v22 citationsh-index: 7MLSP
Originality Synthesis-oriented
AI Analysis

This work addresses uncertainty quantification for researchers in computational chemistry and drug design, offering an incremental improvement by applying an existing UQ technique to a specific generative model.

The paper tackled the challenge of quantifying epistemic uncertainty in deep generative models for molecular design, specifically the JT-VAE, by using active subspaces to approximate posterior distributions, resulting in a method that captures model diversity without altering architectures.

Deep generative models have been accelerating the inverse design process in material and drug design. Unlike their counterpart property predictors in typical molecular design frameworks, generative molecular design models have seen fewer efforts on uncertainty quantification (UQ) due to computational challenges in Bayesian inference posed by their large number of parameters. In this work, we focus on the junction-tree variational autoencoder (JT-VAE), a popular model for generative molecular design, and address this issue by leveraging the low dimensional active subspace to capture the uncertainty in the model parameters. Specifically, we approximate the posterior distribution over the active subspace parameters to estimate the epistemic model uncertainty in an extremely high dimensional parameter space. The proposed UQ scheme does not require alteration of the model architecture, making it readily applicable to any pre-trained model. Our experiments demonstrate the efficacy of the AS-based UQ and its potential impact on molecular optimization by exploring the model diversity under epistemic uncertainty.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes