CL AI CC LG NEFeb 4, 2025

Position: Scaling LLM Agents Requires Asymptotic Analysis with LLM Primitives

arXiv:2502.04358v212.07 citationsh-index: 16ICML

Originality Incremental advance

AI Analysis

It addresses the need for scalable and efficient LLM agent systems in AI, but is incremental as it builds on existing decomposition methods by introducing a theoretical framework.

This position paper argues that decomposing problems into LLM-based agents requires asymptotic analysis with LLM primitives to optimize efficiency, proposing the LLM forward pass as the atomic cost unit to separate model internals from orchestration strategies.

Decomposing hard problems into subproblems often makes them easier and more efficient to solve. With large language models (LLMs) crossing critical reliability thresholds for a growing slate of capabilities, there is an increasing effort to decompose systems into sets of LLM-based agents, each of whom can be delegated sub-tasks. However, this decomposition (even when automated) is often intuitive, e.g., based on how a human might assign roles to members of a human team. How close are these role decompositions to optimal? This position paper argues that asymptotic analysis with LLM primitives is needed to reason about the efficiency of such decomposed systems, and that insights from such analysis will unlock opportunities for scaling them. By treating the LLM forward pass as the atomic unit of computational cost, one can separate out the (often opaque) inner workings of a particular LLM from the inherent efficiency of how a set of LLMs are orchestrated to solve hard problems. In other words, if we want to scale the deployment of LLMs to the limit, instead of anthropomorphizing LLMs, asymptotic analysis with LLM primitives should be used to reason about and develop more powerful decompositions of large problems into LLM agents.

View on arXiv PDF

Similar