CLAIJun 23, 2022

Do Trajectories Encode Verb Meaning?

arXiv:2206.11953v1628 citationsh-index: 45
Originality Incremental advance
AI Analysis

This addresses the challenge of grounding verb meaning in AI and language models, though it is incremental in scope.

The paper investigates whether object trajectories encode verb semantics, finding that raw trajectories correlate with some verbs like 'fall', and self-supervised pretraining improves capture of nuanced differences such as 'roll' vs. 'slide'.

Distributional models learn representations of words from text, but are criticized for their lack of grounding, or the linking of text to the non-linguistic world. Grounded language models have had success in learning to connect concrete categories like nouns and adjectives to the world via images and videos, but can struggle to isolate the meaning of the verbs themselves from the context in which they typically occur. In this paper, we investigate the extent to which trajectories (i.e. the position and rotation of objects over time) naturally encode verb semantics. We build a procedurally generated agent-object-interaction dataset, obtain human annotations for the verbs that occur in this data, and compare several methods for representation learning given the trajectories. We find that trajectories correlate as-is with some verbs (e.g., fall), and that additional abstraction via self-supervised pretraining can further capture nuanced differences in verb meaning (e.g., roll vs. slide).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes