CLMLJun 2, 2023

MathChat: Converse to Tackle Challenging Math Problems with LLM Agents

arXiv:2306.01337v371 citationsh-index: 48
Originality Incremental advance
AI Analysis

This work addresses the problem of solving complex mathematical problems expressed in natural language for fields like science and engineering, representing an incremental advancement in LLM-based agents.

The paper tackles the challenge of solving difficult math problems by introducing MathChat, a conversational framework where an LLM agent and a user proxy agent collaborate through dialogue, achieving a 6% improvement over previous tool-using prompting methods on high school competition problems from the MATH dataset.

Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields. LLMs, with their generalized ability, are used as a foundation model to build AI agents for different tasks. In this paper, we study the effectiveness of utilizing LLM agents to solve math problems through conversations. We propose MathChat, a conversational problem-solving framework designed for math problems. MathChat consists of an LLM agent and a user proxy agent which is responsible for tool execution and additional guidance. This synergy facilitates a collaborative problem-solving process, where the agents engage in a dialogue to solve the problems. We perform evaluation on difficult high school competition problems from the MATH dataset. Utilizing Python, we show that MathChat can further improve previous tool-using prompting methods by 6%.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes