CLAIMar 14, 2024

Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact Guidance

arXiv:2403.09085v212 citationsHas CodeNIPS
AI Analysis

This addresses a key limitation in LLMs' ability to reason abstractly, which is important for advancing AI towards more human-like intelligence, though it is incremental in nature.

The paper tackles the problem of large language models (LLMs) struggling with abstract reasoning when using generic facts, revealing a substantial performance gap. It introduces a dataset and learning paradigm that boosts general reasoning and improves abstract reasoning capabilities.

Large language models (LLMs) have developed impressive performance and strong explainability across various reasoning scenarios, marking a significant stride towards mimicking human-like intelligence. Despite this, when tasked with several simple questions supported by a generic fact, LLMs often struggle to abstract and apply the generic fact to provide consistent and precise answers, revealing a deficiency in abstract reasoning abilities. This has sparked a vigorous debate about whether LLMs are genuinely reasoning or merely memorizing. In light of this, we design a preliminary study to quantify and delve into the abstract reasoning abilities of existing LLMs. Our findings reveal a substantial discrepancy between their general reasoning and abstract reasoning performances. To relieve this problem, we tailor an abstract reasoning dataset (AbsR) together with a meaningful learning paradigm to teach LLMs how to leverage generic facts for reasoning purposes. The results show that our approach not only boosts the general reasoning performance of LLMs but also makes considerable strides towards their capacity for abstract reasoning, moving beyond simple memorization or imitation to a more nuanced understanding and application of generic facts. The code is available at https://github.com/Waste-Wood/MeanLearn.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes