AILGOct 16, 2025

ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning

MILA
arXiv:2510.14176v26 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses the problem of automating reward design for RL practitioners, offering a novel integration of foundation models with reward machines, though it is incremental in combining existing concepts.

The paper tackles the challenge of reward function specification in reinforcement learning by introducing ARM-FM, a framework that uses foundation models to automatically generate reward machines from natural language, enabling compositional reward design and showing effectiveness in diverse environments with zero-shot generalization.

Reinforcement learning (RL) algorithms are highly sensitive to reward function specification, which remains a central challenge limiting their broad applicability. We present ARM-FM: Automated Reward Machines via Foundation Models, a framework for automated, compositional reward design in RL that leverages the high-level reasoning capabilities of foundation models (FMs). Reward machines (RMs) -- an automata-based formalism for reward specification -- are used as the mechanism for RL objective specification, and are automatically constructed via the use of FMs. The structured formalism of RMs yields effective task decompositions, while the use of FMs enables objective specifications in natural language. Concretely, we (i) use FMs to automatically generate RMs from natural language specifications; (ii) associate language embeddings with each RM automata-state to enable generalization across tasks; and (iii) provide empirical evidence of ARM-FM's effectiveness in a diverse suite of challenging environments, including evidence of zero-shot generalization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes