MLLGNANov 15, 2018

The Trace Criterion for Kernel Bandwidth Selection for Support Vector Data Description

arXiv:1811.06838v31 citations
Originality Incremental advance
AI Analysis

This addresses the bandwidth selection issue in SVDD and one-class SVMs for anomaly detection, offering an incremental improvement over existing methods.

The paper tackles the problem of selecting the Gaussian kernel bandwidth for Support Vector Data Description (SVDD) in anomaly detection, presenting a new unsupervised method based on a low-rank representation of the kernel matrix, which is competitive with state-of-the-art for low-dimensional data and performs extremely well for high-dimensional data.

Support vector data description (SVDD) is a popular anomaly detection technique. The SVDD classifier partitions the whole data space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, for which the Gaussian kernel is a common choice. The Gaussian kernel has a bandwidth parameter, and it is important to set the value of this parameter correctly for good results. A small bandwidth leads to overfitting such that the resulting SVDD classifier overestimates the number of anomalies, whereas a large bandwidth leads to underfitting and an inability to detect many anomalies. In this paper, we present a new unsupervised method for selecting the Gaussian kernel bandwidth. Our method exploits a low-rank representation of the kernel matrix to suggest a kernel bandwidth value. Our new technique is competitive with the current state of the art for low-dimensional data and performs extremely well for many classes of high-dimensional data. Because the mathematical formulation of SVDD is identical with the mathematical formulation of one-class support vector machines (OCSVM) when the Gaussian kernel is used, our method is equally applicable to Gaussian kernel bandwidth tuning for OCSVM.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes