MLLGJan 24, 2020

Sparse Semi-supervised Heterogeneous Interbattery Bayesian Analysis

arXiv:2001.08975v115 citations
AI Analysis

This provides a more adaptable tool for machine learning practitioners dealing with complex data types, though it appears incremental as it builds upon an existing Bayesian model.

The paper tackled the problem of developing a versatile factor analysis framework for heterogeneous data, resulting in a model that outperforms most state-of-the-art algorithms across four tested scenarios.

The Bayesian approach to feature extraction, known as factor analysis (FA), has been widely studied in machine learning to obtain a latent representation of the data. An adequate selection of the probabilities and priors of these bayesian models allows the model to better adapt to the data nature (i.e. heterogeneity, sparsity), obtaining a more representative latent space. The objective of this article is to propose a general FA framework capable of modelling any problem. To do so, we start from the Bayesian Inter-Battery Factor Analysis (BIBFA) model, enhancing it with new functionalities to be able to work with heterogeneous data, include feature selection, and handle missing values as well as semi-supervised problems. The performance of the proposed model, Sparse Semi-supervised Heterogeneous Interbattery Bayesian Analysis (SSHIBA) has been tested on 4 different scenarios to evaluate each one of its novelties, showing not only a great versatility and an interpretability gain, but also outperforming most of the state-of-the-art algorithms.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes