LGAICEQMAug 24, 2024

ReactZyme: A Benchmark for Enzyme-Reaction Prediction

arXiv:2408.13659v322 citationsh-index: 14Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of enzyme function annotation for biologists and researchers in drug development and biotechnology, though it appears incremental as it builds on existing datasets and retrieval-based approaches.

The paper tackles enzyme function prediction by introducing a new benchmark and method for predicting enzyme-catalyzed reactions, using machine learning on a large dataset to rank enzymes for specific reactions, achieving adaptability to novel reactions and proteins.

Enzymes, with their specific catalyzed reactions, are necessary for all aspects of life, enabling diverse biological processes and adaptations. Predicting enzyme functions is essential for understanding biological pathways, guiding drug development, enhancing bioproduct yields, and facilitating evolutionary studies. Addressing the inherent complexities, we introduce a new approach to annotating enzymes based on their catalyzed reactions. This method provides detailed insights into specific reactions and is adaptable to newly discovered reactions, diverging from traditional classifications by protein family or expert-derived reaction classes. We employ machine learning algorithms to analyze enzyme reaction datasets, delivering a much more refined view on the functionality of enzymes. Our evaluation leverages the largest enzyme-reaction dataset to date, derived from the SwissProt and Rhea databases with entries up to January 8, 2024. We frame the enzyme-reaction prediction as a retrieval problem, aiming to rank enzymes by their catalytic ability for specific reactions. With our model, we can recruit proteins for novel reactions and predict reactions in novel proteins, facilitating enzyme discovery and function annotation (https://github.com/WillHua127/ReactZyme).

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes