LGMEMLOct 28, 2019

Variable Selection with Copula Entropy

arXiv:1910.12389v216 citations
Originality Incremental advance
AI Analysis

This addresses the problem of building explainable models in machine learning and statistics, though it appears incremental as it builds on existing variable selection techniques.

The paper tackled variable selection for classification and regression by proposing a Copula Entropy (CE) based method, which was shown to select more effective and interpretable variables without sacrificing accuracy compared to traditional methods like Distance Correlation and LASSO on UCI heart disease data.

Variable selection is of significant importance for classification and regression tasks in machine learning and statistical applications where both predictability and explainability are needed. In this paper, a Copula Entropy (CE) based method for variable selection which use CE based ranks to select variables is proposed. The method is both model-free and tuning-free. Comparison experiments between the proposed method and traditional variable selection methods, such as Distance Correlation, Hilbert-Schmidt Independence Criterion, Stepwise Selection, regularized generalized linear models and Adaptive LASSO, were conducted on the UCI heart disease data. Experimental results show that CE based method can select the `right' variables out more effectively and derive better interpretable results than traditional methods do without sacrificing accuracy performance. It is believed that CE based variable selection can help to build more explainable models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes