CLMLNov 16, 2022

Galactica: A Large Language Model for Science

arXiv:2211.09085v11031 citationsh-index: 22Has Code
Originality Highly original
AI Analysis

This addresses the challenge of accessing and organizing scientific knowledge for researchers, offering a new interface for science with broad potential impact.

The paper tackles the problem of information overload in science by introducing Galactica, a large language model trained on scientific corpora, which outperforms existing models on various scientific tasks, such as achieving 68.2% on LaTeX equations compared to GPT-3's 49.0% and setting new state-of-the-art scores like 77.6% on PubMedQA.

Information overload is a major obstacle to scientific progress. The explosive growth in scientific literature and data has made it ever harder to discover useful insights in a large mass of information. Today scientific knowledge is accessed through search engines, but they are unable to organize scientific knowledge alone. In this paper we introduce Galactica: a large language model that can store, combine and reason about scientific knowledge. We train on a large scientific corpus of papers, reference material, knowledge bases and many other sources. We outperform existing models on a range of scientific tasks. On technical knowledge probes such as LaTeX equations, Galactica outperforms the latest GPT-3 by 68.2% versus 49.0%. Galactica also performs well on reasoning, outperforming Chinchilla on mathematical MMLU by 41.3% to 35.7%, and PaLM 540B on MATH with a score of 20.4% versus 8.8%. It also sets a new state-of-the-art on downstream tasks such as PubMedQA and MedMCQA dev of 77.6% and 52.9%. And despite not being trained on a general corpus, Galactica outperforms BLOOM and OPT-175B on BIG-bench. We believe these results demonstrate the potential for language models as a new interface for science. We open source the model for the benefit of the scientific community.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes