CLAIDec 18, 2025

Grammar-Forced Translation of Natural Language to Temporal Logic using LLMs

arXiv:2512.16814v16 citationsh-index: 14ICML
Originality Incremental advance
AI Analysis

This addresses the challenge of accurate and efficient translation for robotics and autonomous systems, though it is incremental as it builds on existing decomposition methods.

The paper tackles the problem of translating natural language to temporal logic for human-robot communication by proposing Grammar Forced Translation (GraFT), which restricts output tokens to improve accuracy, resulting in a 5.49% increase in end-to-end translation accuracy and 14.06% in out-of-domain accuracy on benchmarks.

Translating natural language (NL) into a formal language such as temporal logic (TL) is integral for human communication with robots and autonomous systems. State-of-the-art approaches decompose the task into a lifting of atomic propositions (APs) phase and a translation phase. However, existing methods struggle with accurate lifting, the existence of co-references, and learning from limited data. In this paper, we propose a framework for NL to TL translation called Grammar Forced Translation (GraFT). The framework is based on the observation that previous work solves both the lifting and translation steps by letting a language model iteratively predict tokens from its full vocabulary. In contrast, GraFT reduces the complexity of both tasks by restricting the set of valid output tokens from the full vocabulary to only a handful in each step. The solution space reduction is obtained by exploiting the unique properties of each problem. We also provide a theoretical justification for why the solution space reduction leads to more efficient learning. We evaluate the effectiveness of GraFT using the CW, GLTL, and Navi benchmarks. Compared with state-of-the-art translation approaches, it can be observed that GraFT the end-to-end translation accuracy by 5.49% and out-of-domain translation accuracy by 14.06% on average.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes