GNLGNEApr 5, 2018

Feedback GAN (FBGAN) for DNA: a Novel Feedback-Loop Architecture for Optimizing Protein Functions

arXiv:1804.01694v162 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of optimizing protein functions in synthetic biology, offering a domain-specific but potentially extensible method for generating useful biological sequences.

The authors tackled the problem of generating synthetic DNA sequences for proteins with desired properties by proposing a novel feedback-loop architecture called FBGAN, which optimizes sequences using an external function analyzer without requiring differentiability, and demonstrated its effectiveness in generating antimicrobial peptides and optimizing secondary structures with desirable biophysical properties.

Generative Adversarial Networks (GANs) represent an attractive and novel approach to generate realistic data, such as genes, proteins, or drugs, in synthetic biology. Here, we apply GANs to generate synthetic DNA sequences encoding for proteins of variable length. We propose a novel feedback-loop architecture, called Feedback GAN (FBGAN), to optimize the synthetic gene sequences for desired properties using an external function analyzer. The proposed architecture also has the advantage that the analyzer need not be differentiable. We apply the feedback-loop mechanism to two examples: 1) generating synthetic genes coding for antimicrobial peptides, and 2) optimizing synthetic genes for the secondary structure of their resulting peptides. A suite of metrics demonstrate that the GAN generated proteins have desirable biophysical properties. The FBGAN architecture can also be used to optimize GAN-generated datapoints for useful properties in domains beyond genomics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes