AIJan 13, 2025

Kriging and Gaussian Process Interpolation for Georeferenced Data Augmentation

arXiv:2501.07183v12 citationsh-index: 27
Originality Synthesis-oriented
AI Analysis

This work addresses data scarcity in agricultural monitoring for researchers and practitioners, but it is incremental as it applies existing interpolation techniques to a specific domain.

This study tackled the problem of augmenting limited geo-referenced data for predicting Commelina benghalensis L. in sugarcane plots, finding that Gaussian process-based methods, especially with combined kernels, significantly improved regression algorithm performance while requiring less additional data.

Data augmentation is a crucial step in the development of robust supervised learning models, especially when dealing with limited datasets. This study explores interpolation techniques for the augmentation of geo-referenced data, with the aim of predicting the presence of Commelina benghalensis L. in sugarcane plots in La R{é}union. Given the spatial nature of the data and the high cost of data collection, we evaluated two interpolation approaches: Gaussian processes (GPs) with different kernels and kriging with various variograms. The objectives of this work are threefold: (i) to identify which interpolation methods offer the best predictive performance for various regression algorithms, (ii) to analyze the evolution of performance as a function of the number of observations added, and (iii) to assess the spatial consistency of augmented datasets. The results show that GP-based methods, in particular with combined kernels (GP-COMB), significantly improve the performance of regression algorithms while requiring less additional data. Although kriging shows slightly lower performance, it is distinguished by a more homogeneous spatial coverage, a potential advantage in certain contexts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes