AICLApr 15

TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration

arXiv:2604.1411698.41 citationsh-index: 5
Predicted impact top 5% in AI · last 90 daysOriginality Incremental advance
AI Analysis

For researchers and practitioners, TREX automates the complex, multi-step process of LLM fine-tuning, reducing manual effort and enabling systematic exploration of training strategies.

TREX is a multi-agent system that automates the entire LLM fine-tuning lifecycle, including requirement analysis, literature research, strategy formulation, data preparation, and training/evaluation. On the FT-Bench benchmark of 10 real-world tasks, TREX consistently optimizes model performance on target tasks.

While Large Language Models (LLMs) have empowered AI research agents to perform isolated scientific tasks, automating complex, real-world workflows, such as LLM training, remains a significant challenge. In this paper, we introduce TREX, a multi-agent system that automates the entire LLM training life-cycle. By orchestrating collaboration between two core modules-the Researcher and the Executor-the system seamlessly performs requirement analysis, open-domain literature and data research, formulation of training strategies, preparation of data recipes, and model training and evaluation. The multi-round experimental process is modeled as a search tree, enabling the system to efficiently plan exploration paths, reuse historical results, and distill high-level insights from iterative trials. To evaluate the capability of automated LLM training, we construct FT-Bench, a benchmark comprising 10 tasks derived from real-world scenarios, ranging from optimizing fundamental model capabilities to enhancing performance on domain-specific tasks. Experimental results demonstrate that the TREX agent consistently optimizes model performance on target tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes