LGApr 25, 2017

FWDA: a Fast Wishart Discriminant Analysis with its Application to Electronic Health Records Data Classification

arXiv:1704.07790v12 citations
Originality Highly original
AI Analysis

This work addresses classification challenges in Electronic Health Records for disease detection, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackles the problems of ill-posed parameter estimation and linear inseparability in Linear Discriminant Analysis for Electronic Health Records classification by proposing FWDA, a Fast Wishart Discriminant Analysis that uses an ensemble of classifiers with Bayesian voting, achieving superior performance over state-of-the-art methods on large-scale datasets.

Linear Discriminant Analysis (LDA) on Electronic Health Records (EHR) data is widely-used for early detection of diseases. Classical LDA for EHR data classification, however, suffers from two handicaps: the ill-posed estimation of LDA parameters (e.g., covariance matrix), and the "linear inseparability" of EHR data. To handle these two issues, in this paper, we propose a novel classifier FWDA -- Fast Wishart Discriminant Analysis, that makes predictions in an ensemble way. Specifically, FWDA first surrogates the distribution of inverse covariance matrices using a Wishart distribution estimated from the training data, then "weighted-averages" the classification results of multiple LDA classifiers parameterized by the sampled inverse covariance matrices via a Bayesian Voting scheme. The weights for voting are optimally updated to adapt each new input data, so as to enable the nonlinear classification. Theoretical analysis indicates that FWDA possesses a fast convergence rate and a robust performance on high dimensional data. Extensive experiments on large-scale EHR dataset show that our approach outperforms state-of-the-art algorithms by a large margin.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes