Learning Multi-Step Reasoning by Solving Arithmetic Tasks

被引:0
|
作者
Wang, Tianduo [1 ]
Lu, Wei [1 ]
机构
[1] Singapore Univ Technol & Design, StatNLP Res Grp, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mathematical reasoning is regarded as a necessary ability for Language Models (LMs). Recent works demonstrate large LMs' impressive performance in solving math problems. The success is attributed to their Chain-of-Thought (CoT) reasoning abilities, i.e., the ability to decompose complex questions into step-by-step reasoning chains, but such ability seems only to emerge from models with abundant parameters. This work investigates how to incorporate relatively small LMs with the capabilities of multi-step reasoning. We propose to inject such abilities by continually pre-training LMs on a synthetic dataset MSAT which is composed of Multi-step Arithmetic Tasks. Our experiments on four math word problem datasets show the effectiveness of the proposed method in enhancing LMs' math reasoning abilities.(1)
引用
下载
收藏
页码:1229 / 1238
页数:10
相关论文
共 50 条
  • [31] Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval
    Zhao, Chen
    Xiong, Chenyan
    Boyd-Graber, Jordan
    Daume, Hal, III
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 4635 - 4641
  • [32] Multi-step Forecasting via Multi-task Learning
    Jawed, Shayan
    Rashed, Ahmed
    Schmidt-Thieme, Lars
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 790 - 799
  • [33] Probing Cross-Modal Representations in Multi-Step Relational Reasoning
    Parfenova, Iuliia
    Elliott, Desmond
    Fernandez, Raquel
    Pezzelle, Sandro
    REPL4NLP 2021: PROCEEDINGS OF THE 6TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP, 2021, : 152 - 162
  • [34] Efficient Multi-step Reasoning Attention Network for Visual Question Answering
    Zhang, Haotian
    Wu, Wei
    Zhang, Meng
    THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [35] Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog
    Gan, Zhe
    Cheng, Yu
    El Kholy, Ahmed
    Li, Linjie
    Liu, Jingjing
    Gao, Jianfeng
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6463 - 6474
  • [36] Intention Modulation for Multi-step Tasks in Continuous Time Active Inference
    Priorelli, Matteo
    Stoianov, Ivilin Peev
    ACTIVE INFERENCE, IWAI 2022, 2023, 1721 : 274 - 284
  • [37] Multi-step Prediction for Learning Invariant Representations in Reinforcement Learning
    Xu, Xinyue
    Lv, Kai
    Dong, Xingye
    Han, Sheng
    Lin, Youfang
    2021 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS), 2021, : 202 - 206
  • [38] Multi-step root solvers of Traub's type in real interval arithmetic
    Petkovic, Miodrag S.
    APPLIED MATHEMATICS AND COMPUTATION, 2014, 248 : 430 - 440
  • [39] Transfer Learning for Multi-Step Resource Utilization Prediction
    Parera, Claudia
    Liao, Qi
    Malanchini, Ilaria
    Wellington, Dan
    Redondi, Alessandro E. C.
    Cesana, Matteo
    2020 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2020,
  • [40] Analysis of multi-step algorithms for cognitive maps learning
    Jastriebow, A.
    Poczeta, K.
    BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2014, 62 (04) : 735 - 741