QMAIIRJun 13, 2024

Bioptic B1: A Target-Agnostic Potency-Based Small Molecules Search Engine

arXiv:2406.14572v4
Originality Incremental advance
AI Analysis

This work addresses the computational bottleneck in virtual screening for drug discovery, enabling faster and more effective searches for novel molecules.

The researchers tackled the challenge of efficiently screening ultra-large chemical libraries with large models by developing a target-agnostic, potency-based molecule search engine, achieving 100% recall rate on a 40-billion-molecule library.

Recent successes in virtual screening have been made possible by large models and extensive chemical libraries. However, combining these elements is challenging: the larger the model, the more expensive it is to run, making ultra-large libraries unfeasible. To address this, we developed a target-agnostic, efficacy-based molecule search model, which allows us to find structurally dissimilar molecules with similar biological activities. We used the best practices to design fast retrieval system, based on processor-optimized SIMD instructions, enabling us to screen the ultra-large 40B Enamine REAL library with 100\% recall rate. We extensively benchmarked our model and several state-of-the-art models for both speed performance and retrieval quality of novel molecules.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes