Mathematical Reasoning via Multi-step Self Questioning and Answering for Small Language Models

被引：0

作者：

Chen, Kaiyuan ^{[1
]}

Wang, Jin ^{[1
]}

Zhang, Xuejie ^{[1
]}

机构：

[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming, Yunnan, Peoples R China

来源：

NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT IV, NLPCC 2024 | 2025年 / 15362卷

基金：

中国国家自然科学基金;

关键词：

Mathematical Reasoning; Knowledge Distillation; Small Language Models;

D O I：

10.1007/978-981-97-9440-9_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Mathematical reasoning is challenging for large language models (LLMs), while the scaling relationship concerning LLM capacity is under-explored. Existing works have tried to leverage the rationales of LLMs to train small language models (SLMs) for enhanced reasoning abilities, referred to as distillation. However, most existing distillation methods have not considered guiding the small models to solve problems progressively from simple to complex, which can be a more effective way. This study proposes a multi-step self questioning and answering (M-SQA) method that guides SLMs to solve complex problems by starting from simple ones. Initially, multi-step self-questioning and answering rationales are extracted from LLMs based on complexity-based prompting. Subsequently, these rationales are employed for distilling SLMs in a multi-task learning framework, during which the model learns to multi-step reason in a self questioning and answering way and answer each sub-question in a single step iteratively. Experiments on current mathematical reasoning tasks demonstrate the effectiveness of the proposed approach.

引用

页码：81 / 93

页数：13

共 50 条

[31] Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts
Lan, Yunshi
Li, Xiang
Liu, Xin
Li, Yang
Qin, Wei
Qian, Weining
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4389 - 4400
[32] Multi-step learning and underlying structure in statistical models
Fraser, Maia
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[33] QUESTION ANSWERING SYSTEM ON MATHEMATICAL-MODELS (QAS) - DESCRIPTION OF LANGUAGE
KONOPASEK, M
PAPACONSTADOPOULOS, C
COMPUTER LANGUAGES, 1978, 3 (03): : 145 - 155
[34] QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering
Yasunaga, Michihiro
Ren, Hongyu
Bosselut, Antoine
Liang, Percy
Leskovec, Jure
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 535 - 546
[35] JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering
Sun, Yueqing
Shi, Qi
Qi, Le
Zhang, Yu
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5049 - 5060
[36] Multi-Agent Reinforcement Learning with Multi-Step Generative Models
Krupnik, Orr
Mordatch, Igor
Tamar, Aviv
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[37] QAGCN: Answering Multi-relation Questions via Single-Step Implicit Reasoning over Knowledge Graphs
Wang, Ruijie
Rossetto, Luca
Cochez, Michael
Bernstein, Abraham
SEMANTIC WEB, PT I, ESWC 2024, 2024, 14664 : 41 - 58
[38] Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
Xi, Zhiheng
Jin, Senjie
Zhou, Yuhao
Zheng, Rui
Gao, Songyang
Gu, Tao
Zhang, Qi
Huang, Xuanjing
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11383 - 11406
[39] A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models
Stolfo, Alessandro
Jin, Zhijing
Shridhar, Kumar
Scholkopf, Bernhard
Sachan, Mrinmaya
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 545 - 561
[40] Exploring Reversal Mathematical Reasoning Ability for Large Language Models
Guo, Pei
You, Wangjie
Li, Juntao
Yan, Bowen
Zhang, Min
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13671 - 13685

← 1 2 3 4 5 →