AI QMMay 6

Curated AI beats frontier LLMs at pharma asset discovery

arXiv:2605.0490826.2

AI Analysis

For pharmaceutical researchers, this demonstrates that curated domain-specific AI can dramatically outperform general-purpose LLMs for drug asset discovery.

Gosset, a curated AI platform, returns 3.2x more verified drugs per query than frontier LLMs with web search on niche oncology/immunology targets, achieving perfect precision and 100% recall.

General-purpose LLMs with web search are increasingly used to scout the competitive landscape of pharmaceutical pipelines. We benchmark Gosset -- an AI platform with a chat interface backed by curated target-, modality-, and indication-level drug-asset annotations -- against four frontier systems with web access (Claude Opus 4.7, GPT 5.5, Gemini 3.1 Pro, Perplexity sonar-pro) on ten niche oncology/immunology targets where most of the pipeline lives in the long tail of preclinical and Asian-developed assets. All five systems receive the same natural-language query and the same JSON output schema. Across 10 targets Gosset returns 3.2x more verified drugs per query than the best frontier system, at perfect precision and 100% recall against the cross-system union of verified drugs. The same curated index is exposed as a Gosset MCP server that any frontier model can call as a tool, suggesting that each of these systems can close most of the recall gap by swapping generic web search for a curated index behind the same chat interface.

View on arXiv PDF

Similar