CL AIJun 16, 2025

Leveraging In-Context Learning for Language Model Agents

Shivanshu Gupta, Sameer Singh, Ashish Sabharwal, Tushar Khot, Ben Bogin

arXiv:2506.13109v18.34 citationsh-index: 42

Originality Highly original

AI Analysis

This work addresses the problem of enabling effective ICL for agentic tasks, which is incremental as it builds on existing ICL methods by adapting them to a new domain.

The paper tackles the challenge of applying in-context learning (ICL) to agentic tasks requiring sequential decision-making by proposing an algorithm for automatic annotation and set-selection of demonstrations, resulting in improved performance, reliability, robustness, and efficiency for LLM agents, with ICL agents rivaling costlier trained agents.

In-context learning (ICL) with dynamically selected demonstrations combines the flexibility of prompting large language models (LLMs) with the ability to leverage training data to improve performance. While ICL has been highly successful for prediction and generation tasks, leveraging it for agentic tasks that require sequential decision making is challenging -- one must think not only about how to annotate long trajectories at scale and how to select demonstrations, but also what constitutes demonstrations, and when and where to show them. To address this, we first propose an algorithm that leverages an LLM with retries along with demonstrations to automatically and efficiently annotate agentic tasks with solution trajectories. We then show that set-selection of trajectories of similar tasks as demonstrations significantly improves performance, reliability, robustness, and efficiency of LLM agents. However, trajectory demonstrations have a large inference cost overhead. We show that this can be mitigated by using small trajectory snippets at every step instead of an additional trajectory. We find that demonstrations obtained from larger models (in the annotation phase) also improve smaller models, and that ICL agents can even rival costlier trained agents. Thus, our results reveal that ICL, with careful use, can be very powerful for agentic tasks as well.

View on arXiv PDF

Similar