CLAIIRLGFeb 17, 2023

Complex QA and language models hybrid architectures, Survey

arXiv:2302.09051v522 citationsh-index: 22
Originality Synthesis-oriented
AI Analysis

It addresses the problem of LLMs struggling with complex, domain-specific questions for researchers and practitioners, but is incremental as a survey.

This paper surveys hybrid architectures for complex question-answering using large language models, reviewing limitations like specialized knowledge requirements and solutions including training, prompting, and agent-based approaches.

This paper reviews the state-of-the-art of large language models (LLM) architectures and strategies for "complex" question-answering with a focus on hybrid architectures. LLM based chatbot services have allowed anyone to grasp the potential of LLM to solve many common problems, but soon discovered their limitations for complex questions. Addressing more specific, complex questions (e.g., "What is the best mix of power-generation methods to reduce climate change ?") often requires specialized architectures, domain knowledge, new skills, decomposition and multi-step resolution, deep reasoning, sensitive data protection, explainability, and human-in-the-loop processes. Therefore, we review: (1) necessary skills and tasks for handling complex questions and common LLM limits to overcome; (2) dataset, cost functions and evaluation metrics for measuring and improving (e.g. accuracy, explainability, fairness, robustness, groundedness, faithfulness, toxicity...); (3) family of solutions to overcome LLM limitations by (a) training and reinforcement (b) hybridization, (c) prompting, (d) agentic-architectures (agents, tools) and extended reasoning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes