CVMar 15, 2024

Mitigating Dialogue Hallucination for Large Vision Language Models via Adversarial Instruction Tuning

arXiv:2403.10492v35 citationsh-index: 6
Originality Incremental advance
AI Analysis

This addresses reliability issues for LVLMs used as general-purpose assistants, though it is incremental as it builds on existing methods for hallucination mitigation.

The paper tackles the problem of dialogue-induced hallucinations in Large Vision Language Models (LVLMs) by introducing an Adversarial Question Generator to create a benchmark and proposing Adversarial Instruction Tuning to mitigate bias, resulting in reduced hallucinations while maintaining performance.

Mitigating hallucinations of Large Vision Language Models,(LVLMs) is crucial to enhance their reliability for general-purpose assistants. This paper shows that such hallucinations of LVLMs can be significantly exacerbated by preceding user-system dialogues. To precisely measure this, we first present an evaluation benchmark by extending popular multi-modal benchmark datasets with prepended hallucinatory dialogues powered by our novel Adversarial Question Generator (AQG), which can automatically generate image-related yet adversarial dialogues by adopting adversarial attacks on LVLMs. On our benchmark, the zero-shot performance of state-of-the-art LVLMs drops significantly for both the VQA and Captioning tasks. Next, we further reveal this hallucination is mainly due to the prediction bias toward preceding dialogues rather than visual content. To reduce this bias, we propose Adversarial Instruction Tuning (AIT) that robustly fine-tunes LVLMs against hallucinatory dialogues. Extensive experiments show our proposed approach successfully reduces dialogue hallucination while maintaining performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes