AIHCROJun 11, 2023

Improving Knowledge Extraction from LLMs for Task Learning through Agent Analysis

arXiv:2306.06770v48 citationsh-index: 17
Originality Incremental advance
AI Analysis

This addresses the challenge of adapting LLM knowledge for autonomous agents in real-world environments, representing an incremental improvement over prompt engineering.

The paper tackles the problem of extracting relevant, situationally grounded knowledge from LLMs for embodied agents learning novel tasks, achieving 77-94% task completion in one-shot learning without oversight and 100% with human oversight.

Large language models (LLMs) offer significant promise as a knowledge source for task learning. Prompt engineering has been shown to be effective for eliciting knowledge from an LLM, but alone it is insufficient for acquiring relevant, situationally grounded knowledge for an embodied agent learning novel tasks. We describe a cognitive-agent approach, STARS, that extends and complements prompt engineering, mitigating its limitations and thus enabling an agent to acquire new task knowledge matched to its native language capabilities, embodiment, environment, and user preferences. The STARS approach is to increase the response space of LLMs and deploy general strategies, embedded within the autonomous agent, to evaluate, repair, and select among candidate responses produced by the LLM. We describe the approach and experiments that show how an agent, by retrieving and evaluating a breadth of responses from the LLM, can achieve 77-94% task completion in one-shot learning without user oversight. The approach achieves 100% task completion when human oversight (such as an indication of preference) is provided. Further, the type of oversight largely shifts from explicit, natural language instruction to simple confirmation/discomfirmation of high-quality responses that have been vetted by the agent before presentation to a user.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes