LGAIJan 23, 2023

Feature construction using explanations of individual predictions

arXiv:2301.09631v113 citationsh-index: 27
Originality Highly original
AI Analysis

This addresses the challenge of efficient and interpretable feature construction for machine learning practitioners, offering a novel method that reduces computational time while maintaining or improving model performance.

The paper tackles the problem of feature construction being time-consuming by proposing a heuristic approach that aggregates instance-based explanations to reduce the search space, showing significant improvements in classification accuracy on 30 real-world datasets and generating interpretable features validated by a domain expert.

Feature construction can contribute to comprehensibility and performance of machine learning models. Unfortunately, it usually requires exhaustive search in the attribute space or time-consuming human involvement to generate meaningful features. We propose a novel heuristic approach for reducing the search space based on aggregation of instance-based explanations of predictive models. The proposed Explainable Feature Construction (EFC) methodology identifies groups of co-occurring attributes exposed by popular explanation methods, such as IME and SHAP. We empirically show that reducing the search to these groups significantly reduces the time of feature construction using logical, relational, Cartesian, numerical, and threshold num-of-N and X-of-N constructive operators. An analysis on 10 transparent synthetic datasets shows that EFC effectively identifies informative groups of attributes and constructs relevant features. Using 30 real-world classification datasets, we show significant improvements in classification accuracy for several classifiers and demonstrate the feasibility of the proposed feature construction even for large datasets. Finally, EFC generated interpretable features on a real-world problem from the financial industry, which were confirmed by a domain expert.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes