SEApr 21

From Task to Tutorial: An Automated GUI Framework for Excel Tutorial Document and Video Creation

arXiv:2509.2181634.3h-index: 30
Predicted impact top 7% in SE · last 90 daysOriginality Incremental advance
AI Analysis

For Excel users and tutorial creators, this framework automates the labor-intensive process of tutorial generation, enabling scalable and high-quality tutorials that approach expert-authored quality.

This paper presents the first framework for automatically generating Excel tutorials from natural language task descriptions, achieving an 8.5% improvement in task execution success rate over state-of-the-art baselines and reducing time costs to 1/20 of expert authoring.

Excel is one of the most widely used productivity tools across domains, offering rich functionality but also overwhelming users with its complexity. This creates a persistent demand for tutorials to support effective usage. However, while building and maintaining the Microsoft tutorial corpus, we observed that existing tutorials are manually created by experts, need frequent updates with each software release, and involve substantial human labor. Moreover, prior work has not achieved fully automated tutorial generation. In this paper, we present the first framework for automatically generating Excel tutorials directly from natural language task descriptions. Our framework first instantiates the task. Then a central component of this framework, Execution Agent, plans and executes the solution in Excel, and collects the intermediate artifacts required for tutorial construction. These artifacts are then transformed into both structured Excel documents and video demonstrations. To build a comprehensive tutorial corpus, we collected 1,559 task descriptions from real-world scenarios. In addition, we designed a systematic evaluation framework that integrates assessments from both large language models (LLMs) and human reviewers. Experimental results show that our framework improves task execution success rates by 8.5% over state-of-the-art baselines. Moreover, the generated tutorials demonstrate superior readability and instructional effectiveness, often approaching or surpassing expert-authored materials. Importantly, the automated pipeline eliminates manual labor and reduces time costs to 1/20 of expert authoring, making scalable and high-quality tutorial generation practical for the first time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes