AICLApr 15, 2025

Enhancing multimodal analogical reasoning with Logic Augmented Generation

arXiv:2504.11190v21 citationsh-index: 26
Originality Incremental advance
AI Analysis

This work addresses the problem of implicit knowledge extraction for AI systems, offering an incremental improvement in multimodal reasoning tasks.

The paper tackled the challenge of extracting implicit knowledge from natural language by using a logic-augmented generation framework with semantic knowledge graphs to enhance multimodal analogical reasoning, achieving results that surpass current baselines and outperform humans in understanding visual metaphors.

Recent advances in Large Language Models have demonstrated their capabilities across a variety of tasks. However, automatically extracting implicit knowledge from natural language remains a significant challenge, as machines lack active experience with the physical world. Given this scenario, semantic knowledge graphs can serve as conceptual spaces that guide the automated text generation reasoning process to achieve more efficient and explainable results. In this paper, we apply a logic-augmented generation (LAG) framework that leverages the explicit representation of a text through a semantic knowledge graph and applies it in combination with prompt heuristics to elicit implicit analogical connections. This method generates extended knowledge graph triples representing implicit meaning, enabling systems to reason on unlabeled multimodal data regardless of the domain. We validate our work through three metaphor detection and understanding tasks across four datasets, as they require deep analogical reasoning capabilities. The results show that this integrated approach surpasses current baselines, performs better than humans in understanding visual metaphors, and enables more explainable reasoning processes, though still has inherent limitations in metaphor understanding, especially for domain-specific metaphors. Furthermore, we propose a thorough error analysis, discussing issues with metaphorical annotations and current evaluation methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes