MathPrompter: Mathematical Reasoning using Large Language Models

Imani, Shima; Du, Liang; Shrivastava, Harsh

Computer Science > Computation and Language

arXiv:2303.05398 (cs)

[Submitted on 4 Mar 2023]

Title:MathPrompter: Mathematical Reasoning using Large Language Models

Authors:Shima Imani, Liang Du, Harsh Shrivastava

View PDF

Abstract:Large Language Models (LLMs) have limited performance when solving arithmetic reasoning tasks and often provide incorrect answers. Unlike natural language understanding, math problems typically have a single correct answer, making the task of generating accurate solutions more challenging for LLMs. To the best of our knowledge, we are not aware of any LLMs that indicate their level of confidence in their responses which fuels a trust deficit in these models impeding their adoption. To address this deficiency, we propose `MathPrompter', a technique that improves performance of LLMs on arithmetic problems along with increased reliance in the predictions. MathPrompter uses the Zero-shot chain-of-thought prompting technique to generate multiple Algebraic expressions or Python functions to solve the same math problem in different ways and thereby raise the confidence level in the output results. This is in contrast to other prompt based CoT methods, where there is no check on the validity of the intermediate steps followed. Our technique improves over state-of-the-art on the MultiArith dataset ($78.7\%\rightarrow92.5\%$) evaluated using 175B parameter GPT-based LLM.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2303.05398 [cs.CL]
	(or arXiv:2303.05398v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2303.05398

Submission history

From: Harsh Shrivastava [view email]
[v1] Sat, 4 Mar 2023 04:43:49 UTC (306 KB)

Computer Science > Computation and Language

Title:MathPrompter: Mathematical Reasoning using Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MathPrompter: Mathematical Reasoning using Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators