CLAILGOct 20, 2023

AllTogether: Investigating the Efficacy of Spliced Prompt for Web Navigation using Large Language Models

arXiv:2310.18331v22 citationsh-index: 2Has Code
AI Analysis

This work addresses the efficiency of spliced prompts for LLM-driven web agents, providing incremental insights for future research in this domain.

The paper tackled the problem of improving large language models' performance in web navigation tasks by introducing AllTogether, a standardized prompt template that enhances task context representation, resulting in findings that GPT-4 outperforms smaller models and that HTML snippet length and history trajectory significantly influence performance.

Large Language Models (LLMs) have emerged as promising agents for web navigation tasks, interpreting objectives and interacting with web pages. However, the efficiency of spliced prompts for such tasks remains underexplored. We introduces AllTogether, a standardized prompt template that enhances task context representation, thereby improving LLMs' performance in HTML-based web navigation. We evaluate the efficacy of this approach through prompt learning and instruction finetuning based on open-source Llama-2 and API-accessible GPT models. Our results reveal that models like GPT-4 outperform smaller models in web navigation tasks. Additionally, we find that the length of HTML snippet and history trajectory significantly influence performance, and prior step-by-step instructions prove less effective than real-time environmental feedback. Overall, we believe our work provides valuable insights for future research in LLM-driven web agents.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes