LGMay 19, 2025

Dual-Agent Reinforcement Learning for Automated Feature Generation

arXiv:2505.12628v1h-index: 7Has CodeIJCAI
Originality Incremental advance
AI Analysis

This work addresses feature generation challenges in machine learning for tabular data, representing an incremental improvement over existing reinforcement learning methods.

The paper tackles the problem of generating redundant features and poor state representation in automated feature generation by proposing a dual-agent reinforcement learning method with self-attention and diverse operations, resulting in effective performance on multiple datasets.

Feature generation involves creating new features from raw data to capture complex relationships among the original features, improving model robustness and machine learning performance. Current methods using reinforcement learning for feature generation have made feature exploration more flexible and efficient. However, several challenges remain: first, during feature expansion, a large number of redundant features are generated. When removing them, current methods only retain the best features each round, neglecting those that perform poorly initially but could improve later. Second, the state representation used by current methods fails to fully capture complex feature relationships. Third, there are significant differences between discrete and continuous features in tabular data, requiring different operations for each type. To address these challenges, we propose a novel dual-agent reinforcement learning method for feature generation. Two agents are designed: the first generates new features, and the second determines whether they should be preserved. A self-attention mechanism enhances state representation, and diverse operations distinguish interactions between discrete and continuous features. The experimental results on multiple datasets demonstrate that the proposed method is effective. The code is available at https://github.com/extess0/DARL.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes