CLAIJun 17, 2018

Multimodal Grounding for Language Processing

arXiv:1806.06371v21103 citations
AI Analysis

It provides a methodological overview for researchers in language processing, but is incremental as a survey.

This survey examines how multimodal processing aids in grounding language concepts, categorizing information flow and analyzing methods for combining multimodal representations, with a focus on verbs for compositional language power.

This survey discusses how recent developments in multimodal processing facilitate conceptual grounding of language. We categorize the information flow in multimodal processing with respect to cognitive models of human information processing and analyze different methods for combining multimodal representations. Based on this methodological inventory, we discuss the benefit of multimodal grounding for a variety of language processing tasks and the challenges that arise. We particularly focus on multimodal grounding of verbs which play a crucial role for the compositional power of language.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes