LGFeb 6, 2025

Regularization via f-Divergence: An Application to Multi-Oxide Spectroscopic Analysis

Weizhi Li, Natalie Klein, Brendan Gifford, Elizabeth Sklute, Carey Legett, Samuel Clegg

arXiv:2502.03755v19.42 citationsh-index: 2Has Code

Originality Incremental advance

AI Analysis

This addresses the task of chemical composition analysis for planetary science, offering an incremental improvement in regularization for multi-target regression.

The paper tackles the problem of predicting multi-oxide weights from spectroscopic data under Martian conditions using CNNs, proposing an f-divergence regularization method that performs better than or comparable to standard methods like L1, L2, and dropout, with combined use further enhancing performance.

In this paper, we address the task of characterizing the chemical composition of planetary surfaces using convolutional neural networks (CNNs). Specifically, we seek to predict the multi-oxide weights of rock samples based on spectroscopic data collected under Martian conditions. We frame this problem as a multi-target regression task and propose a novel regularization method based on f-divergence. The f-divergence regularization is designed to constrain the distributional discrepancy between predictions and noisy targets. This regularizer serves a dual purpose: on the one hand, it mitigates overfitting by enforcing a constraint on the distributional difference between predictions and noisy targets. On the other hand, it acts as an auxiliary loss function, penalizing the neural network when the divergence between the predicted and target distributions becomes too large. To enable backpropagation during neural network training, we develop a differentiable f-divergence and incorporate it into the f-divergence regularization, making the network training feasible. We conduct experiments using spectra collected in a Mars-like environment by the remote-sensing instruments aboard the Curiosity and Perseverance rovers. Experimental results on multi-oxide weight prediction demonstrate that the proposed $f$-divergence regularization performs better than or comparable to standard regularization methods including $L_1$, $L_2$, and dropout. Notably, combining the $f$-divergence regularization with these standard regularization further enhances performance, outperforming each regularization method used independently.

View on arXiv PDF Code

Similar