MLLGCHEM-PHJun 17, 2020

MoFlow: An Invertible Flow Model for Generating Molecular Graphs

arXiv:2006.10137v1371 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of accelerating drug discovery by generating novel, valid molecular structures, though it builds incrementally on existing flow-based methods.

The authors tackled the problem of generating chemically valid molecular graphs with desired properties by proposing MoFlow, an invertible flow-based model that achieves 100% reconstruction of training data and state-of-the-art performance on tasks like property optimization.

Generating molecular graphs with desired chemical properties driven by deep graph generative models provides a very promising way to accelerate drug discovery process. Such graph generative models usually consist of two steps: learning latent representations and generation of molecular graphs. However, to generate novel and chemically-valid molecular graphs from latent representations is very challenging because of the chemical constraints and combinatorial complexity of molecular graphs. In this paper, we propose MoFlow, a flow-based graph generative model to learn invertible mappings between molecular graphs and their latent representations. To generate molecular graphs, our MoFlow first generates bonds (edges) through a Glow based model, then generates atoms (nodes) given bonds by a novel graph conditional flow, and finally assembles them into a chemically valid molecular graph with a posthoc validity correction. Our MoFlow has merits including exact and tractable likelihood training, efficient one-pass embedding and generation, chemical validity guarantees, 100\% reconstruction of training data, and good generalization ability. We validate our model by four tasks: molecular graph generation and reconstruction, visualization of the continuous latent space, property optimization, and constrained property optimization. Our MoFlow achieves state-of-the-art performance, which implies its potential efficiency and effectiveness to explore large chemical space for drug discovery.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes