CVNov 26, 2024

Diagram-Driven Course Questions Generation

arXiv:2411.17771v52 citationsh-index: 18EMNLP
Originality Incremental advance
AI Analysis

This addresses the need for automated pedagogical assessment tools in education by focusing on diagrams, though it is incremental as it adapts existing vision-language models to a new domain.

The paper tackles the problem of generating educational questions from diagrams, which are underrepresented in visual question generation research, by introducing the Diagram-Driven Course Questions Generation task and a dataset of 15,720 diagrams and 25,798 questions across 37 subjects. Their Hierarchical Knowledge Integration framework outperforms existing models on this dataset while maintaining generalizability to natural images.

Visual Question Generation (VQG) research focuses predominantly on natural images while neglecting the diagram, which is a critical component in educational materials. To meet the needs of pedagogical assessment, we propose the Diagram-Driven Course Questions Generation (DDCQG) task and construct DiagramQG, a comprehensive dataset with 15,720 diagrams and 25,798 questions across 37 subjects and 371 courses. Our approach employs course and input text constraints to generate course-relevant questions about specific diagram elements. We reveal three challenges of DDCQG: domain-specific knowledge requirements across courses, long-tail distribution in course coverage, and high information density in diagrams. To address these, we propose the Hierarchical Knowledge Integration framework (HKI-DDCQG), which utilizes trainable CLIP for identifying relevant diagram patches, leverages frozen vision-language models for knowledge extraction, and generates questions with trainable T5. Experiments demonstrate that HKI-DDCQG outperforms existing models on DiagramQG while maintaining strong generalizability across natural image datasets, establishing a strong baseline for DDCQG.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes