AIJun 2

Distilling Answer-Set Programming Rules from LLMs for Neurosymbolic Visual Question Answering

Thomas Eiter, Nelson Higuera Ruiz, Johannes Oetsch

arXiv:2606.0326913.1

Predicted impact top 74% in AI · last 90 daysOriginality Incremental advance

AI Analysis

For developers of neurosymbolic VQA systems, this reduces the burden of manually adapting logic-based reasoning components when task requirements change.

The paper presents a method for distilling answer-set programming rules from LLMs to extend reasoning theories for VQA, achieving effective rule generation with only a few examples across diverse datasets.

Visual Question Answering (VQA) is the task of answering questions about images, requiring the integration of multimodal input and reasoning. Modular approaches that incorporate logic-based representations into the reasoning component offer clear advantages over end-to-end trained systems, particularly in terms of interpretability. However, adapting or extending these representations when task requirements change can place a significant burden on developers. To address this challenge, we present an approach for distilling rules from Large Language Models (LLMs). Our method prompts an LLM to extend an initial VQA reasoning theory, expressed as an answer-set program, to meet new requirements of the task. Examples from VQA datasets guide the LLM, validate the results, and help correct erroneous rules by leveraging feedback from the ASP solver. We demonstrate that our approach is effective across diverse VQA datasets. Notably, only a few examples are needed to elicit correct rules from LLMs. Our experiments suggest that rule distillation from LLMs is a promising alternative to traditional data-driven rule learning approaches. Under consideration in Theory and Practice of Logic Programming (TPLP).

View on arXiv PDF

Similar