Generalizing Math Word Problem Solvers via Solution Diversification

被引:0
|
作者
Liang, Zhenwen [1 ]
Zhang, Jipeng [2 ]
Wang, Lei [3 ]
Wang, Yan [4 ]
Shao, Jie [5 ]
Zhang, Xiangliang [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Singapore Management Univ, Singapore, Singapore
[4] Tencent AI Lab, Shenzhen, Peoples R China
[5] Univ Elect Sci & Technol China, Chengdu, Peoples R China
关键词
MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current math word problem (MWP) solvers are usually Seq2Seq models trained by the (one-problem; one-solution) pairs, each of which is made of a problem description and a solution showing reasoning flow to get the correct answer. However, one MWP problem naturally has multiple solution equations. The training of an MWP solver with (one-problem; one-solution) pairs excludes other correct solutions, and thus limits the generalizability of the MWP solver. One feasible solution to this limitation is to augment multiple solutions to a given problem. However, it is difficult to collect diverse and accurate augment solutions through human efforts. In this paper, we design a new training framework for an MWP solver by introducing a solution buffer and a solution discriminator. The buffer includes solutions generated by an MWP solver to encourage the training data diversity. The discriminator controls the quality of buffered solutions to participate in training. Our framework is flexibly applicable to a wide setting of fully, semi-weakly and weakly supervised training for all Seq2Seq MWP solvers. We conduct extensive experiments on a benchmark dataset Math23k and a new dataset named Weak12k, and show that our framework improves the performance of various MWP solvers under different settings by generating correct and diverse solutions.
引用
收藏
页码:13183 / 13191
页数:9
相关论文
共 50 条
  • [41] Template-Based Contrastive Distillation Pretraining for Math Word Problem Solving
    Qin, Jinghui
    Yang, Zhicheng
    Chen, Jiaqi
    Liang, Xiaodan
    Lin, Liang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12823 - 12835
  • [42] HGEN: Learning Hierarchical Heterogeneous Graph Encoding for Math Word Problem Solving
    Zhang, Yi
    Zhou, Guangyou
    Xie, Zhiwen
    Huang, Jimmy Xiangji
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 816 - 828
  • [43] Teacher-Student Networks with Multiple Decoders for Solving Math Word Problem
    Zhang, Jipeng
    Lee, Roy Ka-Wei
    Lim, Ee-Peng
    Qin, Wei
    Wang, Lei
    Shao, Jie
    Sun, Qianru
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4011 - 4017
  • [44] The Effects of Dynamic Strategic Math on English Language Learners' Word Problem Solving
    Orosco, Michael J.
    Swanson, H. Lee
    O'Connor, Rollanda
    Lussier, Cathy
    JOURNAL OF SPECIAL EDUCATION, 2013, 47 (02): : 96 - 107
  • [45] Recycling Numeracy Data Augmentation with Symbolic Verification for Math Word Problem Solving
    Jen, Tien-Yi
    Huang, Hen-Hsen
    Chen, Hsin-Hsi
    2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 653 - 657
  • [46] Non-Autoregressive Math Word Problem Solver with Unified Tree Structure
    Bin, Yi
    Han, Mengqun
    Shi, Wenhao
    Wang, Lei
    Yang, Yang
    Ng, See-Kiong
    Shen, Heng Tao
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3290 - 3301
  • [47] The Context-Oriented System Based on ELECTRA for Solving Math Word Problem
    Meng, Hao
    Wu, Hao
    Yu, Xinguo
    IEEE TALE2021: IEEE INTERNATIONAL CONFERENCE ON ENGINEERING, TECHNOLOGY AND EDUCATION, 2021, : 976 - 981
  • [48] Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction
    Jie, Zhanming
    Li, Jierui
    Lu, Wei
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5944 - 5955
  • [49] Word Problem Strategy for Latino English Language Learners at Risk for Math Disabilities
    Orosco, Michael J.
    LEARNING DISABILITY QUARTERLY, 2014, 37 (01) : 45 - 53
  • [50] Math Word Problem Solver Based on Text-to-Text Transformer Model
    Yang, Chuanzhi
    Huang, Runze
    Yu, Xinguo
    Peng, Rao
    IEEE TALE2021: IEEE INTERNATIONAL CONFERENCE ON ENGINEERING, TECHNOLOGY AND EDUCATION, 2021, : 818 - 822