Generalizing Math Word Problem Solvers via Solution Diversification

被引:0
|
作者
Liang, Zhenwen [1 ]
Zhang, Jipeng [2 ]
Wang, Lei [3 ]
Wang, Yan [4 ]
Shao, Jie [5 ]
Zhang, Xiangliang [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Singapore Management Univ, Singapore, Singapore
[4] Tencent AI Lab, Shenzhen, Peoples R China
[5] Univ Elect Sci & Technol China, Chengdu, Peoples R China
关键词
MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current math word problem (MWP) solvers are usually Seq2Seq models trained by the (one-problem; one-solution) pairs, each of which is made of a problem description and a solution showing reasoning flow to get the correct answer. However, one MWP problem naturally has multiple solution equations. The training of an MWP solver with (one-problem; one-solution) pairs excludes other correct solutions, and thus limits the generalizability of the MWP solver. One feasible solution to this limitation is to augment multiple solutions to a given problem. However, it is difficult to collect diverse and accurate augment solutions through human efforts. In this paper, we design a new training framework for an MWP solver by introducing a solution buffer and a solution discriminator. The buffer includes solutions generated by an MWP solver to encourage the training data diversity. The discriminator controls the quality of buffered solutions to participate in training. Our framework is flexibly applicable to a wide setting of fully, semi-weakly and weakly supervised training for all Seq2Seq MWP solvers. We conduct extensive experiments on a benchmark dataset Math23k and a new dataset named Weak12k, and show that our framework improves the performance of various MWP solvers under different settings by generating correct and diverse solutions.
引用
收藏
页码:13183 / 13191
页数:9
相关论文
共 50 条
  • [1] An Introspective Data Augmentation Method for Training Math Word Problem Solvers
    Qin, Jinghui
    Huang, Zhongzhan
    Zeng, Ying
    Zhang, Quanshi
    Lin, Liang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3113 - 3127
  • [2] A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers
    Miao, Shen-Yun
    Liang, Chao-Chun
    Su, Keh-Yih
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 975 - 984
  • [3] The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers
    Zhang, Dongxiang
    Wang, Lei
    Zhang, Luming
    Dai, Bing Tian
    Shen, Heng Tao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (09) : 2287 - 2305
  • [4] Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation
    Liang, Zhenwen
    Yu, Wenhao
    Rajpurohit, Tanmay
    Clark, Peter
    Zhang, Xiangliang
    Kalyan, Ashwin
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 14384 - 14396
  • [5] Template-Based Math Word Problem Solvers with Recursive Neural Networks
    Wang, Lei
    Zhang, Dongxiang
    Zhang, Jipeng
    Xu, Xing
    Gao, Lianli
    Dai, Bing Tian
    Shen, Heng Tao
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7144 - 7151
  • [6] Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers
    Kumar, Vivek
    Maheshwary, Rishabh
    Pudi, Vikram
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4194 - 4206
  • [7] Math Word Problem Generation via Disentangled Memory Retrieval
    Qin, Wei
    Wang, Xiaowei
    Hu, Zhenzhen
    Wang, Lei
    Lan, Yunshi
    Hong, Richang
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (05)
  • [8] READING AND REASONING SKILLS FOR MATH PROBLEM SOLVERS
    THOMAS, DA
    JOURNAL OF READING, 1988, 32 (03): : 244 - 249
  • [9] MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers
    Lan, Yihuai
    Wang, Lei
    Zhang, Qiyuan
    Lan, Yunshi
    Dai, Bing Tian
    Wang, Yan
    Zhang, Dongxiang
    Lim, Ee-Peng
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 13188 - 13190
  • [10] SOLUTION OF MATH PROBLEM
    MAZUR, B
    SCIENCE, 1983, 222 (4623) : 456 - 456