IR LGJul 25, 2025

Agent0: Leveraging LLM Agents to Discover Multi-value Features from Text for Enhanced Recommendations

Blaž Škrlj, Benoît Guilleminot, Andraž Tori

arXiv:2507.18993v13.6h-index: 20

Originality Incremental advance

AI Analysis

This work addresses the costly feature engineering bottleneck in large-scale recommender systems, offering an incremental improvement through automation.

The paper tackles the challenge of expensive categorical feature acquisition in recommender systems by introducing Agent0, an LLM-driven agent-based system that automates information extraction and feature construction from unstructured text, demonstrating practical and effective automated feature discovery.

Large language models (LLMs) and their associated agent-based frameworks have significantly advanced automated information extraction, a critical component of modern recommender systems. While these multitask frameworks are widely used in code generation, their application in data-centric research is still largely untapped. This paper presents Agent0, an LLM-driven, agent-based system designed to automate information extraction and feature construction from raw, unstructured text. Categorical features are crucial for large-scale recommender systems but are often expensive to acquire. Agent0 coordinates a group of interacting LLM agents to automatically identify the most valuable text aspects for subsequent tasks (such as models or AutoML pipelines). Beyond its feature engineering capabilities, Agent0 also offers an automated prompt-engineering tuning method that utilizes dynamic feedback loops from an oracle. Our findings demonstrate that this closed-loop methodology is both practical and effective for automated feature discovery, which is recognized as one of the most challenging phases in current recommender system development.

View on arXiv PDF

Similar