MLLGOct 18, 2013

Kernel Multivariate Analysis Framework for Supervised Subspace Learning: A Tutorial on Linear and Kernel Multivariate Methods

arXiv:1310.5089v1116 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental tutorial that synthesizes existing methods for researchers and practitioners dealing with high-dimensional data in fields like signal processing and multimodal analysis.

This paper provides a tutorial on linear and kernel multivariate analysis methods for supervised subspace learning, reviewing techniques like PCA, PLS, CCA, and OPLS, and demonstrates their applicability in classification and regression tasks using benchmark datasets and real-world applications such as music genre prediction and hyperspectral image analysis.

Feature extraction and dimensionality reduction are important tasks in many fields of science dealing with signal processing and analysis. The relevance of these techniques is increasing as current sensory devices are developed with ever higher resolution, and problems involving multimodal data sources become more common. A plethora of feature extraction methods are available in the literature collectively grouped under the field of Multivariate Analysis (MVA). This paper provides a uniform treatment of several methods: Principal Component Analysis (PCA), Partial Least Squares (PLS), Canonical Correlation Analysis (CCA) and Orthonormalized PLS (OPLS), as well as their non-linear extensions derived by means of the theory of reproducing kernel Hilbert spaces. We also review their connections to other methods for classification and statistical dependence estimation, and introduce some recent developments to deal with the extreme cases of large-scale and low-sized problems. To illustrate the wide applicability of these methods in both classification and regression problems, we analyze their performance in a benchmark of publicly available data sets, and pay special attention to specific real applications involving audio processing for music genre prediction and hyperspectral satellite images for Earth and climate monitoring.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes