Seeking Patterns, Not just Memorizing Procedures: Contrastive Learning for Solving Math Word Problems

被引:0
|
作者
Li, Zhongli [1 ]
Zhang, Wenxuan [2 ]
Yan, Chao [2 ]
Zhou, Qingyu [1 ]
Li, Chao [1 ]
Liu, Hongzhi [2 ]
Cau, Yunbo [1 ]
机构
[1] Tencent Cloud Xiaowei, Shenzhen, Guangdong, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Math Word Problem (MWP) solving needs to discover the quantitative relationships over natural language narratives. Recent work shows that existing models memorize procedures from context and rely on shallow heuristics to solve MWPs. In this paper, we look at this issue and argue that the cause is a lack of overall understanding of MWP patterns. We first investigate how a neural network understands patterns only from semantics, and observe that, if the prototype equations like n(1) + n(2) are the same, most problems get closer representations and those representations apart from them or close to other prototypes tend to produce wrong solutions. Inspired by it, we propose a contrastive learning approach, where the neural network perceives the divergence of patterns. We collect contrastive examples by converting the prototype equation into a tree and seeking similar tree structures. The solving model is trained with an auxiliary objective on the collected examples, resulting in the representations of problems with similar prototypes being pulled closer. We conduct experiments1 on the Chinese dataset Math23k and the English dataset MathQA. Our method greatly improves the performance in monolingual and multilingual settings.
引用
收藏
页码:2486 / 2496
页数:11
相关论文
共 50 条
  • [1] Learning by Fixing: Solving Math Word Problems with Weak Supervision
    Hong, Yining
    Li, Qing
    Ciao, Daniel
    Haung, Siyuan
    Zhu, Song-Chun
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4959 - 4967
  • [2] Graph-to-Tree Learning for Solving Math Word Problems
    Zhang, Jipeng
    Wang, Lei
    Lee, Roy Ka-Wei
    Bin, Yi
    Wang, Yan
    Shao, Jie
    Lim, Ee-Peng
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3928 - 3937
  • [3] Solving Math Word Problems with Teacher Supervision
    Liang, Zhenwen
    Zhang, Xiangliang
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3522 - 3528
  • [4] TEACHING STRATEGIES FOR SOLVING WORD-PROBLEMS IN MATH
    DUNLAP, WP
    MCKNIGHT, M
    [J]. ACADEMIC THERAPY, 1980, 15 (04): : 431 - 441
  • [5] ArMATH: a Dataset for Solving Arabic Math Word Problems
    Alghamdi, Reem
    Liang, Zhenwen
    Zhang, Xiangliang
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 351 - 362
  • [6] Goal selection and feedback for solving math word problems
    He, Daijun
    Xiao, Jing
    [J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 14744 - 14758
  • [7] Goal selection and feedback for solving math word problems
    Daijun He
    Jing Xiao
    [J]. Applied Intelligence, 2023, 53 : 14744 - 14758
  • [8] Template-Based Contrastive Distillation Pretraining for Math Word Problem Solving
    Qin, Jinghui
    Yang, Zhicheng
    Chen, Jiaqi
    Liang, Xiaodan
    Lin, Liang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12823 - 12835
  • [9] Classifying and Solving Arithmetic Math Word Problems-An Intelligent Math Solver
    Mandal, Sourav
    Naskar, Sudip Kumar
    [J]. IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2021, 14 (01): : 28 - 41
  • [10] Towards Better Quantity Representations for Solving Math Word Problems
    Sun, Runxin
    He, Shizhu
    Zhao, Jun
    Liu, Kang
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (07)