CVAIApr 18

CoGR-MoE: Concept-Guided Expert Routing with Consistent Selection and Flexible Reasoning for Visual Question Answering

arXiv:2604.1693075.2h-index: 2
Predicted impact top 35% in CV · last 90 daysOriginality Synthesis-oriented
AI Analysis

For VQA researchers, this addresses the trade-off between routing stability and flexibility in MoE, but results are reported without concrete numbers, making the improvement unclear.

CoGR-MoE improves Visual Question Answering by using answer option semantics to guide expert selection in Mixture-of-Experts, achieving strong performance across multiple VQA tasks.

Visual Question Answering (VQA) requires models to identify the correct answer options based on both visual and textual evidence. Recent Mixture-of-Experts (MoE) methods improve option reasoning by grouping similar concepts or routing based on examples. However, unstable routing can lead to inconsistent expert selection in the same question type, while overly stable routing may reduce flexibility. To address this, we propose Concept-Guided Routing framework (CoGR-MoE), which incorporates semantics of the answer options to guide expert selection in the training phase. Next, option features are used to reweight the selected experts, producing discriminative representations for each candidate option. These option-level representations are further used for option comparison and optimized via contrastive learning. The experimental results indicate that CoGR-MoE delivers strong performance across multiple VQA tasks, demonstrating the effectiveness of our approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes