Semantics Altering Modifications for Evaluating Comprehension in Machine Reading

被引:0
|
作者
Schlegel, Viktor [1 ]
Nenadic, Goran [1 ]
Batista-Navarro, Riza [1 ]
机构
[1] Univ Manchester, Dept Comp Sci, Manchester, Lancs, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advances in NLP have yielded impressive results for the task of machine reading comprehension (MRC), with approaches having been reported to achieve performance comparable to that of humans. In this paper, we investigate whether state-of-the-art MRC models are able to correctly process Semantics Altering Modifications (SAM): linguistically-motivated phenomena that alter the semantics of a sentence while preserving most of its lexical surface form. We present a method to automatically generate and align challenge sets featuring original and altered examples. We further propose a novel evaluation methodology to correctly assess the capability of MRC systems to process these examples independent of the data they were optimised on, by discounting for effects introduced by domain shift. In a large-scale empirical study, we apply the methodology in order to evaluate extractive MRC models with regard to their capability to correctly process SAM-enriched data. We comprehensively cover 12 different state-of-the-art neural architecture configurations and four training datasets and find that - despite their well-known remarkable performance - optimised models consistently struggle to correctly process semantically altered data.
引用
收藏
页码:13762 / 13770
页数:9
相关论文
共 50 条
  • [41] Pre-reading Activity over Question for Machine Reading Comprehension
    Yuan, Chenchen
    Liu, Kaiyang
    Zhang, Xulu
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1411 - 1418
  • [42] The effects of orthography, phonology, semantics, and working memory on the reading comprehension of children with and without reading dyslexia
    Ho, Jana Chi-san
    Reed, Deborah K.
    Mcbride, Catherine
    ANNALS OF DYSLEXIA, 2025,
  • [43] Conversational Machine Reading Comprehension for Vietnamese Healthcare Texts
    Luu, Son T.
    Bui, Mao Nguyen
    Nguyen, Loi Duc
    Tran, Khiem Vinh
    Nguyen, Kiet Van
    Nguyen, Ngan Luu-Thuy
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 1463 : 546 - 558
  • [44] A Framework for Evaluation of Machine Reading Comprehension Gold Standards
    Schlegel, Viktor
    Valentino, Marco
    Freitas, Andre
    Nenadic, Goran
    Batista-Navarro, Riza
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5359 - 5369
  • [45] Review of Conversational Machine Reading Comprehension for Knowledge Graph
    Hu, Juan
    Xi, Xuefeng
    Cui, Zhiming
    Computer Engineering and Applications, 2024, 60 (03) : 17 - 28
  • [46] A survey of deep learning techniques for machine reading comprehension
    Samreen Kazi
    Shakeel Khoja
    Ali Daud
    Artificial Intelligence Review, 2023, 56 : 2509 - 2569
  • [47] Fact -Driven Logical Reasoning for Machine Reading Comprehension
    Ouyang, Siru
    Zhang, Zhuosheng
    Zhao, Hai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18851 - 18859
  • [48] Dataset for the First Evaluation on Chinese Machine Reading Comprehension
    Cui, Yiming
    Liu, Ting
    Chen, Zhipeng
    Ma, Wentao
    Wang, Shijin
    Hu, Guoping
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2721 - 2725
  • [49] Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge
    Sun, Kai
    Yu, Dian
    Chen, Jianshu
    Yu, Dong
    Cardie, Claire
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 8736 - 8747
  • [50] EVALUATING THE EFFICACY OF USING A DIGITAL READING ENVIRONMENT TO IMPROVE READING COMPREHENSION WITHIN A READING CLINIC
    Ortlieb, Evan
    Sargent, Stephan
    Moreland, Meagan
    READING PSYCHOLOGY, 2014, 35 (05) : 397 - 421