SPCVIVDec 7, 2020

Active Learning Methods for Efficient Hybrid Biophysical Variable Retrieval

arXiv:2012.04468v10.0091 citations
AI Analysis50

This work provides a method for efficiently training MLRAs for operational biophysical variable retrieval, which is important for remote sensing applications dealing with large datasets.

This paper addresses the challenge of large training datasets in kernel-based machine learning regression algorithms (MLRAs) for biophysical variable retrieval. It introduces six active learning (AL) methods that improve retrieval accuracy for leaf area index and chlorophyll content using PROSAIL simulations, outperforming random sampling at lower sampling rates.

Kernel-based machine learning regression algorithms (MLRAs) are potentially powerful methods for being implemented into operational biophysical variable retrieval schemes. However, they face difficulties in coping with large training datasets. With the increasing amount of optical remote sensing data made available for analysis and the possibility of using a large amount of simulated data from radiative transfer models (RTMs) to train kernel MLRAs, efficient data reduction techniques will need to be implemented. Active learning (AL) methods enable to select the most informative samples in a dataset. This letter introduces six AL methods for achieving optimized biophysical variable estimation with a manageable training dataset, and their implementation into a Matlab-based MLRA toolbox for semi-automatic use. The AL methods were analyzed on their efficiency of improving the estimation accuracy of leaf area index and chlorophyll content based on PROSAIL simulations. Each of the implemented methods outperformed random sampling, improving retrieval accuracy with lower sampling rates. Practically, AL methods open opportunities to feed advanced MLRAs with RTM-generated training data for development of operational retrieval models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes