LGAINov 13, 2024

Searching Latent Program Spaces

arXiv:2411.08706v216 citationsh-index: 1
Originality Highly original
AI Analysis

This addresses the problem of inefficient adaptation in AI systems for researchers and practitioners, though it is incremental by combining symbolic and neural approaches.

The paper tackles the challenge of scaling program synthesis for general intelligence by introducing the Latent Program Network (LPN), which learns a latent space of implicit programs and uses gradient-based search at test time, resulting in doubled performance on out-of-distribution tasks in the ARC-AGI benchmark.

General intelligence requires systems that acquire new skills efficiently and generalize beyond their training distributions. Although program synthesis approaches have strong generalization power, they face scaling issues due to large combinatorial spaces that quickly make them impractical and require human-generated DSLs or pre-trained priors to narrow this search space. On the other hand, deep learning methods have had high successes, but they lack structured test-time adaptation and rely on heavy stochastic sampling or expensive gradient updates for fine-tuning. In this work, we propose the Latent Program Network (LPN), a new architecture that builds in test-time search directly into neural models. LPN learns a latent space of implicit programs--neurally mapping inputs to outputs--through which it can search using gradients at test time. LPN combines the adaptability of symbolic approaches and the scalability of neural methods. It searches through a compact latent space at test time and bypasses the need for pre-defined domain-specific languages. On a range of programming-by-examples tasks, LPN either outperforms or matches performance compared to in-context learning and test-time training methods. Tested on the ARC-AGI benchmark, we demonstrate that LPN can both learn a compact program space and search through it at test time to adapt to novel tasks. LPN doubles its performance on out-of-distribution tasks when test-time search is switched on.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes