LGCHEM-PHFeb 27, 2021

Generative Chemical Transformer: Neural Machine Learning of Molecular Geometric Structures from Chemical Language via Attention

arXiv:2103.00213v262 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of discovering new materials for improving human life quality, representing an incremental advance in applying language models to molecular design.

The authors tackled the problem of inverse molecular design by proposing a Generative Chemical Transformer (GCT) that generates molecules meeting desired conditions, resulting in highly realistic chemical strings that satisfy both chemical and linguistic grammar rules and vary for single condition sets.

Discovering new materials better suited to specific purposes is an important issue in improving the quality of human life. Here, a neural network that creates molecules that meet some desired conditions based on a deep understanding of chemical language is proposed (Generative Chemical Transformer, GCT). The attention mechanism in GCT allows a deeper understanding of molecular structures beyond the limitations of chemical language itself which cause semantic discontinuity by paying attention to characters sparsely. It is investigated that the significance of language models for inverse molecular design problems by quantitatively evaluating the quality of the generated molecules. GCT generates highly realistic chemical strings that satisfy both chemical and linguistic grammar rules. Molecules parsed from generated strings simultaneously satisfy the multiple target properties and vary for a single condition set. These advances will contribute to improving the quality of human life by accelerating the process of desired material discovery.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes