CLAILGNov 17, 2024

Capturing Sparks of Abstraction for the ARC Challenge

arXiv:2411.11206v12 citationsh-index: 1Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving AI performance on abstract reasoning tasks like the ARC Prize, though it appears incremental as it builds on existing methods for code analysis.

The paper tackled the problem of low accuracy in solving ARC Challenge problems by using an LLM to extract abstract explanations from complete code solutions, resulting in reusable functional chunks and high-level tactics that could aid downstream tasks with local LLMs.

Excellent progress has been made recently in solving ARC Challenge problems. However, it seems that new techniques may be required to push beyond 60% accuracy. Even commercial Large Language Models (LLMs) struggle to 'understand' many of the problems (when given the input and output grids), which makes discovering solutions by LLM-lead program search somewhat futile. In this work, LLM 'understanding' is attempted from a stronger starting position : An LLM is given complete solutions to tasks in code, and then asked to explain how the task is being solved at various levels of abstraction. Specifically, the LLM was given code solutions implemented in arc-dsl-llm (an LLM-legible version of Hodel's arc-dsl to obtain: (a) commented code; (b) code refactored into reusable functional chunks; (c) problem solution steps; and (d) high-level problem-solving tactics. We demonstrate that 'Sparks of Abstraction' can be extracted from the LLM output - in a form that could be used in downstream tasks with Local LLMs eligible to enter the ARC Prize. Both the arc-dsl-llm DSL framework (with the re-engineered solutions) and the Gemini LLM-generated data (along with the generation code) are made Open Source.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes