CLJun 17, 2024

Retrieval-Augmented Feature Generation for Domain-Specific Classification

arXiv:2406.11177v42 citations
Originality Incremental advance
AI Analysis

This addresses the need for domain-specific, explainable feature generation in classification tasks, though it appears incremental as it builds on existing feature generation and LLM techniques.

The paper tackles the problem of generating interpretable features for domain-specific classification tasks with limited data by introducing RAFG, a retrieval-augmented method that uses knowledge retrieval and LLMs to produce features, resulting in significant performance improvements across medical, economic, and geographic datasets.

Feature generation can significantly enhance learning outcomes, particularly for tasks with limited data. An effective way to improve feature generation is to expand the current feature space using existing features and enriching the informational content. However, generating new, interpretable features usually requires domain-specific knowledge on top of the existing features. In this paper, we introduce a Retrieval-Augmented Feature Generation method, RAFG, to generate useful and explainable features specific to domain classification tasks. To increase the interpretability of the generated features, we conduct knowledge retrieval among the existing features in the domain to identify potential feature associations. These associations are expected to help generate useful features. Moreover, we develop a framework based on large language models (LLMs) for feature generation with reasoning to verify the quality of the features during their generation process. Experiments across several datasets in medical, economic, and geographic domains show that our RAFG method can produce high-quality, meaningful features and significantly improve classification performance compared with baseline methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes