CVAILGJan 11, 2023

Action Dynamics Task Graphs for Learning Plannable Representations of Procedural Tasks

arXiv:2302.05330v17 citationsh-index: 15
Originality Highly original
AI Analysis

This work addresses the problem of enabling AI systems to understand and guide humans in procedural tasks, such as at-home repairs, with incremental improvements over existing methods.

The paper tackles the problem of extracting structured task representations from video demonstrations and narrations of procedural tasks, such as changing a tire, and demonstrates substantial gains, including a 30.1% improvement in task tracking accuracy and a 20.3% accuracy gain in next action prediction compared to state-of-the-art methods on the CrossTask dataset.

Given video demonstrations and paired narrations of an at-home procedural task such as changing a tire, we present an approach to extract the underlying task structure -- relevant actions and their temporal dependencies -- via action-centric task graphs. Learnt structured representations from our method, Action Dynamics Task Graphs (ADTG), can then be used for understanding such tasks in unseen videos of humans performing them. Furthermore, ADTG can enable providing user-centric guidance to humans in these tasks, either for performing them better or for learning new tasks. Specifically, we show how ADTG can be used for: (1) tracking an ongoing task, (2) recommending next actions, and (3) planning a sequence of actions to accomplish a procedural task. We compare against state-of-the-art Neural Task Graph method and demonstrate substantial gains on 18 procedural tasks from the CrossTask dataset, including 30.1% improvement in task tracking accuracy and 20.3% accuracy gain in next action prediction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes