LG CLSep 16, 2024

Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles

Kulin Shah, Nishanth Dikkala, Xin Wang, Rina Panigrahy

arXiv:2409.10502v124.535 citationsh-index: 17Has Code

Originality Incremental advance

AI Analysis

This work addresses the ongoing debate about the emergence of fundamental search and reasoning capabilities in LLMs, with implications for AI research on logical problem-solving.

The study tackled the problem of whether causal language modeling can learn complex search and reasoning tasks, specifically solving Sudoku and Zebra puzzles, and found that Transformer models trained on logical sequences achieved high accuracy, solving 94.21% of Sudoku puzzles and 92.04% of Zebra puzzles correctly.

Causal language modeling using the Transformer architecture has yielded remarkable capabilities in Large Language Models (LLMs) over the last few years. However, the extent to which fundamental search and reasoning capabilities emerged within LLMs remains a topic of ongoing debate. In this work, we study if causal language modeling can learn a complex task such as solving Sudoku puzzles. To solve a Sudoku, the model is first required to search over all empty cells of the puzzle to decide on a cell to fill and then apply an appropriate strategy to fill the decided cell. Sometimes, the application of a strategy only results in thinning down the possible values in a cell rather than concluding the exact value of the cell. In such cases, multiple strategies are applied one after the other to fill a single cell. We observe that Transformer models trained on this synthetic task can indeed learn to solve Sudokus (our model solves $94.21\%$ of the puzzles fully correctly) when trained on a logical sequence of steps taken by a solver. We find that training Transformers with the logical sequence of steps is necessary and without such training, they fail to learn Sudoku. We also extend our analysis to Zebra puzzles (known as Einstein puzzles) and show that the model solves $92.04 \%$ of the puzzles fully correctly. In addition, we study the internal representations of the trained Transformer and find that through linear probing, we can decode information about the set of possible values in any given cell from them, pointing to the presence of a strong reasoning engine implicit in the Transformer weights.

View on arXiv PDF Code

Similar