CVDec 23, 2024

EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing Modalities

arXiv:2412.17677v15.22 citationsh-index: 3Has Code

Originality Highly original

AI Analysis

This addresses a common challenge in real-world multimodal learning for applications where data may be incomplete, offering a more efficient solution than previous approaches.

The paper tackles the problem of missing modalities in multimodal learning by proposing EPE-P, a parameter-efficient prompting method that reduces complexity and redundant parameters while handling uncertainty with an evidence-based loss, achieving superior effectiveness and efficiency compared to existing methods.

Missing modalities are a common challenge in real-world multimodal learning scenarios, occurring during both training and testing. Existing methods for managing missing modalities often require the design of separate prompts for each modality or missing case, leading to complex designs and a substantial increase in the number of parameters to be learned. As the number of modalities grows, these methods become increasingly inefficient due to parameter redundancy. To address these issues, we propose Evidence-based Parameter-Efficient Prompting (EPE-P), a novel and parameter-efficient method for pretrained multimodal networks. Our approach introduces a streamlined design that integrates prompting information across different modalities, reducing complexity and mitigating redundant parameters. Furthermore, we propose an Evidence-based Loss function to better handle the uncertainty associated with missing modalities, improving the model's decision-making. Our experiments demonstrate that EPE-P outperforms existing prompting-based methods in terms of both effectiveness and efficiency. The code is released at https://github.com/Boris-Jobs/EPE-P_MLLMs-Robustness.

View on arXiv PDF Code

Similar