AIMay 29, 2025

AutoGPS: Automated Geometry Problem Solving via Multimodal Formalization and Deductive Reasoning

arXiv:2505.23381v16 citationsh-index: 10
Originality Incremental advance
AI Analysis

It addresses geometry problem solving for AI, offering improved reliability and interpretability, but is incremental as it builds on existing neuro-symbolic approaches.

The paper tackles geometry problem solving by proposing AutoGPS, a neuro-symbolic framework that formalizes problems and performs deductive reasoning, achieving state-of-the-art performance on benchmarks with 99% stepwise logical coherence.

Geometry problem solving presents distinctive challenges in artificial intelligence, requiring exceptional multimodal comprehension and rigorous mathematical reasoning capabilities. Existing approaches typically fall into two categories: neural-based and symbolic-based methods, both of which exhibit limitations in reliability and interpretability. To address this challenge, we propose AutoGPS, a neuro-symbolic collaborative framework that solves geometry problems with concise, reliable, and human-interpretable reasoning processes. Specifically, AutoGPS employs a Multimodal Problem Formalizer (MPF) and a Deductive Symbolic Reasoner (DSR). The MPF utilizes neural cross-modal comprehension to translate geometry problems into structured formal language representations, with feedback from DSR collaboratively. The DSR takes the formalization as input and formulates geometry problem solving as a hypergraph expansion task, executing mathematically rigorous and reliable derivation to produce minimal and human-readable stepwise solutions. Extensive experimental evaluations demonstrate that AutoGPS achieves state-of-the-art performance on benchmark datasets. Furthermore, human stepwise-reasoning evaluation confirms AutoGPS's impressive reliability and interpretability, with 99\% stepwise logical coherence. The project homepage is at https://jayce-ping.github.io/AutoGPS-homepage.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes