AICLCVCYHCFeb 6, 2024

Enhancing Cross-Modal Contextual Congruence for Crowdfunding Success using Knowledge-infused Learning

arXiv:2402.03607v22 citationsh-index: 25BigData
Originality Incremental advance
AI Analysis

This work addresses the problem of enhancing user engagement and success in crowdfunding campaigns for creators and marketers by improving contextual congruence in multimodal content, though it is incremental as it builds on existing visual language models with knowledge infusion.

The paper tackled the challenge of unifying semantic contextual cues across text and image modalities in multimodal content to predict crowdfunding campaign success, and found that incorporating external commonsense knowledge from knowledge graphs improved predictive performance over baselines without knowledge.

The digital landscape continually evolves with multimodality, enriching the online experience for users. Creators and marketers aim to weave subtle contextual cues from various modalities into congruent content to engage users with a harmonious message. This interplay of multimodal cues is often a crucial factor in attracting users' attention. However, this richness of multimodality presents a challenge to computational modeling, as the semantic contextual cues spanning across modalities need to be unified to capture the true holistic meaning of the multimodal content. This contextual meaning is critical in attracting user engagement as it conveys the intended message of the brand or the organization. In this work, we incorporate external commonsense knowledge from knowledge graphs to enhance the representation of multimodal data using compact Visual Language Models (VLMs) and predict the success of multi-modal crowdfunding campaigns. Our results show that external knowledge commonsense bridges the semantic gap between text and image modalities, and the enhanced knowledge-infused representations improve the predictive performance of models for campaign success upon the baselines without knowledge. Our findings highlight the significance of contextual congruence in online multimodal content for engaging and successful crowdfunding campaigns.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes