HC AINov 10, 2025

NoteEx: Interactive Visual Context Manipulation for LLM-Assisted Exploratory Data Analysis in Computational Notebooks

Mohammad Hasan Payandeh, Lin-Ping Yuan, Jian Zhao

arXiv:2511.07223v14.1h-index: 2

Originality Incremental advance

AI Analysis

This addresses the issue of inefficient LLM context selection for data analysts in computational notebooks, though it is incremental as it builds on existing LLM-assisted EDA tools.

The paper tackled the problem of users struggling to select appropriate context for LLM assistance in exploratory data analysis within computational notebooks, by introducing NoteEx, a tool that improved mental model retention and context selection, leading to more accurate and relevant LLM responses in a user study with 12 participants.

Computational notebooks have become popular for Exploratory Data Analysis (EDA), augmented by LLM-based code generation and result interpretation. Effective LLM assistance hinges on selecting informative context -- the minimal set of cells whose code, data, or outputs suffice to answer a prompt. As notebooks grow long and messy, users can lose track of the mental model of their analysis. They thus fail to curate appropriate contexts for LLM tasks, causing frustration and tedious prompt engineering. We conducted a formative study (n=6) that surfaced challenges in LLM context selection and mental model maintenance. Therefore, we introduce NoteEx, a JupyterLab extension that provides a semantic visualization of the EDA workflow, allowing analysts to externalize their mental model, specify analysis dependencies, and enable interactive selection of task-relevant contexts for LLMs. A user study (n=12) against a baseline shows that NoteEx improved mental model retention and context selection, leading to more accurate and relevant LLM responses.

View on arXiv PDF

Similar