Learning Multi-Step Reasoning by Solving Arithmetic Tasks

被引:0
|
作者
Wang, Tianduo [1 ]
Lu, Wei [1 ]
机构
[1] Singapore Univ Technol & Design, StatNLP Res Grp, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mathematical reasoning is regarded as a necessary ability for Language Models (LMs). Recent works demonstrate large LMs' impressive performance in solving math problems. The success is attributed to their Chain-of-Thought (CoT) reasoning abilities, i.e., the ability to decompose complex questions into step-by-step reasoning chains, but such ability seems only to emerge from models with abundant parameters. This work investigates how to incorporate relatively small LMs with the capabilities of multi-step reasoning. We propose to inject such abilities by continually pre-training LMs on a synthetic dataset MSAT which is composed of Multi-step Arithmetic Tasks. Our experiments on four math word problem datasets show the effectiveness of the proposed method in enhancing LMs' math reasoning abilities.(1)
引用
收藏
页码:1229 / 1238
页数:10
相关论文
共 50 条
  • [1] Multi-Step Reasoning for IoT Devices
    Miguel Blanco, Jose
    Rossi, Bruno
    [J]. PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, ENASE 2023, 2023, : 330 - 337
  • [2] Multi-Step Algorithms for Solving EPs
    Pham Ngoc Anh
    Dang Van Hieu
    [J]. MATHEMATICAL MODELLING AND ANALYSIS, 2018, 23 (03) : 453 - 472
  • [3] A novel multi-step reinforcement learning method for solving reward hacking
    Yuan, Yinlong
    Yu, Zhu Liang
    Gu, Zhenghui
    Deng, Xiaoyan
    Li, Yuanqing
    [J]. APPLIED INTELLIGENCE, 2019, 49 (08) : 2874 - 2888
  • [4] A novel multi-step reinforcement learning method for solving reward hacking
    Yinlong Yuan
    Zhu Liang Yu
    Zhenghui Gu
    Xiaoyan Deng
    Yuanqing Li
    [J]. Applied Intelligence, 2019, 49 : 2874 - 2888
  • [5] Multi-Step Inference for Reasoning Over Paragraphs
    Liu, Jiangming
    Gardner, Matt
    Cohen, Shay B.
    Lapata, Mirella
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3040 - 3050
  • [6] Conditional Visual Servoing for Multi-Step Tasks
    Izquierdo, Sergio
    Argus, Max
    Brox, Thomas
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2190 - 2196
  • [7] Learn multi-step object sorting tasks through deep reinforcement learning
    Bao, Jiatong
    Zhang, Guoqing
    Peng, Yi
    Shao, Zhiyu
    Song, Aiguo
    [J]. ROBOTICA, 2022, 40 (11) : 3878 - 3894
  • [8] Logic Learning From Demonstrations for Multi-Step Manipulation Tasks in Dynamic Environments
    Zhang, Yan
    Xue, Teng
    Razmjoo, Amirreza
    Calinon, Sylvain
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7214 - 7221
  • [9] Explore Multi-Step Reasoning in Video Question Answering
    Han, Yahong
    [J]. PROCEEDINGS OF THE 1ST WORKSHOP AND CHALLENGE ON COMPREHENSIVE VIDEO UNDERSTANDING IN THE WILD (COVIEW'18), 2018, : 5 - 5
  • [10] Explore Multi-Step Reasoning in Video Question Answering
    Song, Xiaomeng
    Shi, Yucheng
    Chen, Xin
    Han, Yahong
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 239 - 247