LGJan 17, 2022

ExpertNet: A Symbiosis of Classification and Clustering

Shivin Srivastava, Kenji Kawaguchi, Vaibhav Rajan

arXiv:2201.06344v11.8

Originality Incremental advance

AI Analysis

This work addresses the need for tailored classifiers in real-world applications like clinical risk models, offering an incremental improvement over existing methods.

The paper tackles the problem of improving generalization in neural models for heterogeneous data by introducing ExpertNet, which learns clustered latent representations and combines cluster-specific classifiers, resulting in superior classification performance on 6 large clinical datasets with insights into group-specific risks.

A widely used paradigm to improve the generalization performance of high-capacity neural models is through the addition of auxiliary unsupervised tasks during supervised training. Tasks such as similarity matching and input reconstruction have been shown to provide a beneficial regularizing effect by guiding representation learning. Real data often has complex underlying structures and may be composed of heterogeneous subpopulations that are not learned well with current approaches. In this work, we design ExpertNet, which uses novel training strategies to learn clustered latent representations and leverage them by effectively combining cluster-specific classifiers. We theoretically analyze the effect of clustering on its generalization gap, and empirically show that clustered latent representations from ExpertNet lead to disentangling the intrinsic structure and improvement in classification performance. ExpertNet also meets an important real-world need where classifiers need to be tailored for distinct subpopulations, such as in clinical risk models. We demonstrate the superiority of ExpertNet over state-of-the-art methods on 6 large clinical datasets, where our approach leads to valuable insights on group-specific risks.

View on arXiv PDF

Similar