CLAIDec 3, 2024

QA-TOOLBOX: Conversational Question-Answering for process task guidance in manufacturing

arXiv:2412.02638v12 citationsh-index: 27Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses process task guidance for technicians in manufacturing, but it is incremental as it focuses on evaluating existing LLMs rather than introducing new methods.

The paper tackled the problem of using LLMs for data augmentation in manufacturing task guidance by evaluating several open-source models on a dataset of over 200,000 question/answer pairs grounded in procedure documents and demonstrations, finding that the task is complex and requiring temporal understanding.

In this work we explore utilizing LLMs for data augmentation for manufacturing task guidance system. The dataset consists of representative samples of interactions with technicians working in an advanced manufacturing setting. The purpose of this work to explore the task, data augmentation for the supported tasks and evaluating the performance of the existing LLMs. We observe that that task is complex requiring understanding from procedure specification documents, actions and objects sequenced temporally. The dataset consists of 200,000+ question/answer pairs that refer to the spec document and are grounded in narrations and/or video demonstrations. We compared the performance of several popular open-sourced LLMs by developing a baseline using each LLM and then compared the responses in a reference-free setting using LLM-as-a-judge and compared the ratings with crowd-workers whilst validating the ratings with experts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes