Mathematical Reasoning via Multi-step Self Questioning and Answering for Small Language Models

被引:0
|
作者
Chen, Kaiyuan [1 ]
Wang, Jin [1 ]
Zhang, Xuejie [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Mathematical Reasoning; Knowledge Distillation; Small Language Models;
D O I
10.1007/978-981-97-9440-9_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mathematical reasoning is challenging for large language models (LLMs), while the scaling relationship concerning LLM capacity is under-explored. Existing works have tried to leverage the rationales of LLMs to train small language models (SLMs) for enhanced reasoning abilities, referred to as distillation. However, most existing distillation methods have not considered guiding the small models to solve problems progressively from simple to complex, which can be a more effective way. This study proposes a multi-step self questioning and answering (M-SQA) method that guides SLMs to solve complex problems by starting from simple ones. Initially, multi-step self-questioning and answering rationales are extracted from LLMs based on complexity-based prompting. Subsequently, these rationales are employed for distilling SLMs in a multi-task learning framework, during which the model learns to multi-step reason in a self questioning and answering way and answer each sub-question in a single step iteratively. Experiments on current mathematical reasoning tasks demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:81 / 93
页数:13
相关论文
共 50 条
  • [31] Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts
    Lan, Yunshi
    Li, Xiang
    Liu, Xin
    Li, Yang
    Qin, Wei
    Qian, Weining
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4389 - 4400
  • [32] Multi-step learning and underlying structure in statistical models
    Fraser, Maia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [33] QUESTION ANSWERING SYSTEM ON MATHEMATICAL-MODELS (QAS) - DESCRIPTION OF LANGUAGE
    KONOPASEK, M
    PAPACONSTADOPOULOS, C
    COMPUTER LANGUAGES, 1978, 3 (03): : 145 - 155
  • [34] QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering
    Yasunaga, Michihiro
    Ren, Hongyu
    Bosselut, Antoine
    Liang, Percy
    Leskovec, Jure
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 535 - 546
  • [35] JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering
    Sun, Yueqing
    Shi, Qi
    Qi, Le
    Zhang, Yu
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5049 - 5060
  • [36] Multi-Agent Reinforcement Learning with Multi-Step Generative Models
    Krupnik, Orr
    Mordatch, Igor
    Tamar, Aviv
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [37] QAGCN: Answering Multi-relation Questions via Single-Step Implicit Reasoning over Knowledge Graphs
    Wang, Ruijie
    Rossetto, Luca
    Cochez, Michael
    Bernstein, Abraham
    SEMANTIC WEB, PT I, ESWC 2024, 2024, 14664 : 41 - 58
  • [38] Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
    Xi, Zhiheng
    Jin, Senjie
    Zhou, Yuhao
    Zheng, Rui
    Gao, Songyang
    Gu, Tao
    Zhang, Qi
    Huang, Xuanjing
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11383 - 11406
  • [39] A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models
    Stolfo, Alessandro
    Jin, Zhijing
    Shridhar, Kumar
    Scholkopf, Bernhard
    Sachan, Mrinmaya
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 545 - 561
  • [40] Exploring Reversal Mathematical Reasoning Ability for Large Language Models
    Guo, Pei
    You, Wangjie
    Li, Juntao
    Yan, Bowen
    Zhang, Min
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13671 - 13685