IMCVMay 30, 2025

Applying Vision Transformers on Spectral Analysis of Astronomical Objects

arXiv:2506.00294v12 citationsh-index: 36Has Code
Originality Incremental advance
AI Analysis

This work addresses spectroscopic data analysis for astronomers, but it is incremental as it adapts an existing method to a new domain.

The paper tackled analyzing astronomical spectral data by applying pre-trained Vision Transformers to one-dimensional spectra converted into images, achieving higher classification accuracy than traditional methods and competitive redshift estimation performance.

We apply pre-trained Vision Transformers (ViTs), originally developed for image recognition, to the analysis of astronomical spectral data. By converting traditional one-dimensional spectra into two-dimensional image representations, we enable ViTs to capture both local and global spectral features through spatial self-attention. We fine-tune a ViT pretrained on ImageNet using millions of spectra from the SDSS and LAMOST surveys, represented as spectral plots. Our model is evaluated on key tasks including stellar object classification and redshift ($z$) estimation, where it demonstrates strong performance and scalability. We achieve classification accuracy higher than Support Vector Machines and Random Forests, and attain $R^2$ values comparable to AstroCLIP's spectrum encoder, even when generalizing across diverse object types. These results demonstrate the effectiveness of using pretrained vision models for spectroscopic data analysis. To our knowledge, this is the first application of ViTs to large-scale, which also leverages real spectroscopic data and does not rely on synthetic inputs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes