CLAIJul 20, 2024

I Need Help! Evaluating LLM's Ability to Ask for Users' Support: A Case Study on Text-to-SQL Generation

arXiv:2407.14767v229 citationsh-index: 8Has Code
AI Analysis

It addresses the problem of improving LLM reliability in real-world applications by balancing performance gains with user burden, though it is incremental as it focuses on a specific case study.

This study investigates whether large language models (LLMs) can proactively ask for user support in text-to-SQL generation, finding that without external feedback, many LLMs struggle to recognize when they need help.

This study explores the proactive ability of LLMs to seek user support. We propose metrics to evaluate the trade-off between performance improvements and user burden, and investigate whether LLMs can determine when to request help under varying information availability. Our experiments show that without external feedback, many LLMs struggle to recognize their need for user support. The findings highlight the importance of external signals and provide insights for future research on improving support-seeking strategies. Source code: https://github.com/appier-research/i-need-help

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes