LGAIMLJan 8, 2022

Attention-based Random Forest and Contamination Model

arXiv:2201.02880v142 citations
AI Analysis

This work addresses incremental improvements in machine learning models for regression and classification tasks, potentially benefiting researchers in ensemble methods.

The authors tackled the problem of improving random forest performance by integrating attention mechanisms, resulting in three modifications that assign trainable attention weights to decision trees based on instance distances within leaves.

A new approach called ABRF (the attention-based random forest) and its modifications for applying the attention mechanism to the random forest (RF) for regression and classification are proposed. The main idea behind the proposed ABRF models is to assign attention weights with trainable parameters to decision trees in a specific way. The weights depend on the distance between an instance, which falls into a corresponding leaf of a tree, and instances, which fall in the same leaf. This idea stems from representation of the Nadaraya-Watson kernel regression in the form of a RF. Three modifications of the general approach are proposed. The first one is based on applying the Huber's contamination model and on computing the attention weights by solving quadratic or linear optimization problems. The second and the third modifications use the gradient-based algorithms for computing trainable parameters. Numerical experiments with various regression and classification datasets illustrate the proposed method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes