LGAIBMOct 15, 2023

Generative artificial intelligence for de novo protein design

arXiv:2310.09685v163 citationsh-index: 9
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of designing new proteins with specific functions for applications in biotechnology and medicine, but it is a review paper, so it is incremental in summarizing existing progress.

The paper reviews how generative AI models, such as language models and diffusion processes, are advancing de novo protein design by generating novel proteins with desirable properties, achieving experimental success rates of nearly 20%.

Engineering new molecules with desirable functions and properties has the potential to extend our ability to engineer proteins beyond what nature has so far evolved. Advances in the so-called "de novo" design problem have recently been brought forward by developments in artificial intelligence. Generative architectures, such as language models and diffusion processes, seem adept at generating novel, yet realistic proteins that display desirable properties and perform specified functions. State-of-the-art design protocols now achieve experimental success rates nearing 20%, thus widening the access to de novo designed proteins. Despite extensive progress, there are clear field-wide challenges, for example in determining the best in silico metrics to prioritise designs for experimental testing, and in designing proteins that can undergo large conformational changes or be regulated by post-translational modifications and other cellular processes. With an increase in the number of models being developed, this review provides a framework to understand how these tools fit into the overall process of de novo protein design. Throughout, we highlight the power of incorporating biochemical knowledge to improve performance and interpretability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes