Semantics Altering Modifications for Evaluating Comprehension in Machine Reading

被引:0
|
作者
Schlegel, Viktor [1 ]
Nenadic, Goran [1 ]
Batista-Navarro, Riza [1 ]
机构
[1] Univ Manchester, Dept Comp Sci, Manchester, Lancs, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advances in NLP have yielded impressive results for the task of machine reading comprehension (MRC), with approaches having been reported to achieve performance comparable to that of humans. In this paper, we investigate whether state-of-the-art MRC models are able to correctly process Semantics Altering Modifications (SAM): linguistically-motivated phenomena that alter the semantics of a sentence while preserving most of its lexical surface form. We present a method to automatically generate and align challenge sets featuring original and altered examples. We further propose a novel evaluation methodology to correctly assess the capability of MRC systems to process these examples independent of the data they were optimised on, by discounting for effects introduced by domain shift. In a large-scale empirical study, we apply the methodology in order to evaluate extractive MRC models with regard to their capability to correctly process SAM-enriched data. We comprehensively cover 12 different state-of-the-art neural architecture configurations and four training datasets and find that - despite their well-known remarkable performance - optimised models consistently struggle to correctly process semantically altered data.
引用
收藏
页码:13762 / 13770
页数:9
相关论文
共 50 条
  • [1] Evaluating Machine Reading Systems through Comprehension Tests
    Penas, Anselmo
    Hovy, Eduard
    Forner, Pamela
    Rodrigo, Alvaro
    Sutcliffe, Richard
    Forascu, Corina
    Sporleder, Caroline
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1143 - 1147
  • [2] XCMRC: Evaluating Cross-Lingual Machine Reading Comprehension
    Liu, Pengyuan
    Deng, Yuning
    Zhu, Chenghao
    Hu, Han
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 552 - 564
  • [3] Comparing the effectiveness of reading modifications on comprehension accuracy and reading comprehension rate
    Bouck, Emily C.
    Truckenmiller, Adrea
    Bone, Erin
    Flanagan, Sara
    PREVENTING SCHOOL FAILURE, 2021, 65 (03): : 194 - 205
  • [4] Machine Comprehension with Syntax, Frames, and Semantics
    Wang, Hai
    Bansal, Mohit
    Gimpel, Kevin
    McAllester, David
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 700 - 706
  • [5] Survey on Machine Reading Comprehension
    Wang X.-J.
    Bai Z.-W.
    Li K.
    Yuan C.-X.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (06): : 1 - 9
  • [6] Event Extraction as Machine Reading Comprehension
    Liu, Jian
    Chen, Yubo
    Liu, Kang
    Bi, Wei
    Liu, Xiaojiang
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1641 - 1651
  • [7] A Survey on Machine Reading Comprehension Systems
    Baradaran, Razieh
    Ghiasi, Razieh
    Amirkhani, Hossein
    NATURAL LANGUAGE ENGINEERING, 2022, 28 (06) : 683 - 732
  • [8] Improving Machine Reading Comprehension with General Reading Strategies
    Sun, Kai
    Yu, Dian
    Yu, Dong
    Cardie, Claire
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2633 - 2643
  • [9] Machine Reading Comprehension: Matching and Orders
    Liu, Ao
    Qu, Lizhen
    Lu, Junyu
    Zhang, Chenbin
    Xu, Zenglin
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2057 - 2060
  • [10] Retrospective Reader for Machine Reading Comprehension
    Zhang, Zhuosheng
    Yang, Junjie
    Zhao, Hai
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14506 - 14514