DBAICLDec 3, 2024

DataLab: A Unified Platform for LLM-Powered Business Intelligence

arXiv:2412.02205v313 citationsh-index: 17ICDE
Originality Incremental advance
AI Analysis

It addresses inefficiencies and errors in business intelligence workflows for organizations by providing a one-stop solution, though it appears incremental as it builds on existing LLM-based agent approaches.

The paper tackles the fragmentation of business intelligence tasks by introducing DataLab, a unified platform that integrates an LLM-based agent framework with a computational notebook interface, achieving up to a 58.58% increase in accuracy and a 61.65% reduction in token cost on enterprise-specific tasks.

Business intelligence (BI) transforms large volumes of data within modern organizations into actionable insights for informed decision-making. Recently, large language model (LLM)-based agents have streamlined the BI workflow by automatically performing task planning, reasoning, and actions in executable environments based on natural language (NL) queries. However, existing approaches primarily focus on individual BI tasks such as NL2SQL and NL2VIS. The fragmentation of tasks across different data roles and tools lead to inefficiencies and potential errors due to the iterative and collaborative nature of BI. In this paper, we introduce DataLab, a unified BI platform that integrates a one-stop LLM-based agent framework with an augmented computational notebook interface. DataLab supports various BI tasks for different data roles in data preparation, analysis, and visualization by seamlessly combining LLM assistance with user customization within a single environment. To achieve this unification, we design a domain knowledge incorporation module tailored for enterprise-specific BI tasks, an inter-agent communication mechanism to facilitate information sharing across the BI workflow, and a cell-based context management strategy to enhance context utilization efficiency in BI notebooks. Extensive experiments demonstrate that DataLab achieves state-of-the-art performance on various BI tasks across popular research benchmarks. Moreover, DataLab maintains high effectiveness and efficiency on real-world datasets from Tencent, achieving up to a 58.58% increase in accuracy and a 61.65% reduction in token cost on enterprise-specific BI tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes