SpecX: A Large-Scale Benchmark for Multi-Modal Spectroscopy and Cross-Paradigm Evaluation

Chengrui Xiang, Tengfei Ma, Yujie Chen, Tong Wang, Haowen Chen, Xiangxiang Zeng

arXiv:2605.1879126.2

Predicted impact top 53% in IV · last 90 daysOriginality Incremental advance

AI Analysis

Provides a unified evaluation framework for spectral intelligence, addressing the lack of large-scale, multi-modal spectral benchmarks for the machine learning and spectroscopy communities.

SpecX introduces a large-scale benchmark with 1.7M molecules and multiple spectral modalities (NMR, IR, MS, UV, Raman, FL) to evaluate both specialized spectral models and multimodal language models, finding that specialized models excel at signal-level tasks while MLLMs lack precise spectral grounding.

Existing spectral benchmarks are limited in scale, modality alignment, and evaluation scope, and typically focus on either specialized models or multimodal language models (MLLMs). We introduce SpecX, a large-scale benchmark for multi-modal spectroscopy with cross-paradigm evaluation. SpecX contains 1.7M molecules with diverse spectral modalities, including NMR (1H, 13C, HSQC), IR, MS,UV,Raman and FL, and is organized into three tiers: a large-scale dataset for pretraining, an aligned multi-spectral subset for benchmarking, and a high-quality experimental subset for evaluation. SpecX supports a range of tasks such as molecular elucidation, spectrum simulation, and spectral understanding, and enables unified evaluation across both specialized spectral models and MLLMs. Experiments show that specialized models excel at signal-level modeling, while MLLMs exhibit strengths in high-level reasoning but lack precise spectral grounding. SpecX establishes a unified benchmark for spectral intelligence and highlights the need for spectrum-native foundation models.

View on arXiv PDF

Similar