AIOct 15, 2024

Implementing Derivations of Definite Logic Programs with Self-Attention Networks

arXiv:2410.11396v11 citationsh-index: 1
Originality Incremental advance
AI Analysis

This work addresses the challenge of understanding and enhancing the logical reasoning abilities of large language models, which is incremental as it builds on existing transformer architectures.

The paper tackles the problem of implementing logical inference with self-attention networks, showing that hierarchical self-attention networks with feed-forward networks can implement both top-down and bottom-up derivations for a class of logical formulae, demonstrating that LLMs implicitly have logical inference capabilities.

In this paper we propose that a restricted version of logical inference can be implemented with self-attention networks. We are aiming at showing that LLMs (Large Language Models) constructed with transformer networks can make logical inferences. We would reveal the potential of LLMs by analyzing self-attention networks, which are main components of transformer networks. Our approach is not based on semantics of natural languages but operations of logical inference. %point of view. We show that hierarchical constructions of self-attention networks with feed forward networks (FFNs) can implement top-down derivations for a class of logical formulae. We also show bottom-up derivations are also implemented for the same class. We believe that our results show that LLMs implicitly have the power of logical inference.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes