Semantic-Based Data Augmentation for Math Word Problems

被引:2
|
作者
Li, Ailisi [1 ]
Xiao, Yanghua [1 ,2 ]
Liang, Jiaqing [1 ]
Chen, Yunwen [3 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Data Sci, Shanghai, Peoples R China
[2] Fudan Aishu Cognit Intelligence Joint Res Ctr, Shanghai, Peoples R China
[3] DataGrand Inc, Shanghai, Peoples R China
关键词
Math word problem; Data augmentation;
D O I
10.1007/978-3-031-00129-1_3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It's hard for neural MWP solvers to deal with tiny local variances. In MWP task, some local changes conserve the original semantic while the others may totally change the underlying logic. Currently, existing datasets for MWP task contain limited samples which are key for neural models to learn to disambiguate different kinds of local variances in questions and solve the questions correctly. In this paper, we propose a set of novel data augmentation approaches to supplement existing datasets with such data that are augmented with different kinds of local variances, and help to improve the generalization ability of current neural models. New samples are generated by knowledge guided entity replacement, and logic guided problem reorganization. The augmentation approaches are ensured to keep the consistency between the new data and their labels. Experimental results have shown the necessity and the effectiveness of our methods.
引用
收藏
页码:36 / 51
页数:16
相关论文
共 50 条
  • [31] SEiS: A semantic-based system for integrating building energy data
    Madrazo, L.
    Massetti, M.
    Sicilia, A.
    Wadel, G.
    Ianni, M.
    INFORMES DE LA CONSTRUCCION, 2015, 67 (537)
  • [32] SHRDIS: A Semantic-based Heterogeneous Relational Data Integration System
    Wang, Jinpeng
    Zhang, Yafei
    Lu, Jianjiang
    Miao, Zhuang
    NANOTECHNOLOGY AND COMPUTER ENGINEERING, 2010, 121-122 : 335 - 340
  • [33] MediGrid - Facilitating Semantic-Based processing of Biomedical Data and Knowledge
    Vejvalka, Jan
    Lesny, Petr
    Holecek, Tomas
    Slaby, Krystof
    Jarolimkova, Adela
    Bouzkova, Helena
    OPEN SOURCE IN EUROPEAN HEALTH CARE: THE TIME IS RIPE, 2009, : 18 - +
  • [34] A Semantic-Based Approach for Managing Healthcare Big Data: A Survey
    Hammad, Rafat
    Barhoush, Malek
    Abed-alguni, Bilal H.
    JOURNAL OF HEALTHCARE ENGINEERING, 2020, 2020
  • [35] A Generation-based Deductive Method for Math Word Problems
    Hu, Yuxuan
    Zhang, Jing
    Li, Haoyang
    Li, Cuiping
    Chen, Hong
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 1737 - 1750
  • [36] Semantic-Based Test Oracles
    Bai, Xiaoying
    Hou, Kejia
    Lu, Hao
    Zhang, Yao
    Hu, Linping
    Ye, Hong
    2011 35TH IEEE ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2011, : 640 - 649
  • [37] Semantic-Based Process Analysis
    Di Francescomarino, Chiara
    Corcoglioniti, Francesco
    Dragoni, Mauro
    Bertoli, Piergiorgio
    Tiella, Roberto
    Ghidini, Chiara
    Nori, Michele
    Pistore, Marco
    SEMANTIC WEB - ISWC 2014, PT II, 2014, 8797 : 228 - 243
  • [38] Math Word Problems: Reading Math Situations From the Start
    Sherman, Khristine
    Gabriel, Rachael
    READING TEACHER, 2017, 70 (04): : 473 - 477
  • [39] An Approach Towards Multilingual Translation By Semantic-Based Verb Identification And Root Word Analysis
    Anik, Md. Saidul Hoque
    Islam, Md. Adnanul
    Al Islam, A. B. M. Alim
    PROCEEDINGS OF 2018 5TH INTERNATIONAL CONFERENCE ON NETWORKING, SYSTEMS AND SECURITY (NSYSS), 2018, : 120 - 128
  • [40] Deep Visual Semantic Embedding with Text Data Augmentation and Word Embedding Initialization
    He, Hai
    Yang, Haibo
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021