Evaluating Neural Model Robustness for Machine Comprehension

被引:0
|
作者
Wu, Winston [1 ]
Arendt, Dustin [2 ]
Volkova, Svitlana [3 ]
机构
[1] Johns Hopkins Univ, Dept Comp Sci, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
[2] Pacific Northwest Natl Lab, Visual Analyt Grp, Richland, WA USA
[3] Pacific Northwest Natl Lab, Data Sci & Analyt Grp, Richland, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We evaluate neural model robustness to adversarial attacks using different types of linguistic unit perturbations - character and word, and propose a new method for strategic sentencelevel perturbations. We experiment with different amounts of perturbations to examine model confidence and misclassification rate, and contrast model performance with different embeddings BERT and ELMo on two benchmark datasets SQuAD and TriviaQA. We demonstrate how to improve model performance during an adversarial attack by using ensembles. Finally, we analyze factors that affect model behavior under adversarial attack, and develop a new model to predict errors during attacks. Our novel findings reveal that (a) unlike BERT, models that use ELMo embeddings are more susceptible to adversarial attacks, (b) unlike word and paraphrase, character perturbations affect the model the most but are most easily compensated for by adversarial training, (c) word perturbations lead to more high-confidence misclassifications compared to sentence- and character-level perturbations, (d) the type of question and model answer length (the longer the answer the more likely it is to be incorrect) is the most predictive of model errors in adversarial setting, and (e) conclusions about model behavior are dataset-specific.
引用
收藏
页码:2470 / 2481
页数:12
相关论文
共 50 条
  • [1] Robustness-Eva-MRC: Assessing and analyzing the robustness of neural models in extractive machine reading comprehension
    Fang, Jingliang
    Xu, Hua
    Wu, Zhijing
    Gao, Kai
    Che, Xiaoyin
    Hui, Haotian
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 20
  • [2] Robustness of Chinese Machine Reading Comprehension
    Li Y.
    Tang H.
    Qian J.
    Zou B.
    Hong Y.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2021, 57 (01): : 16 - 22
  • [3] Benchmarking Robustness of Machine Reading Comprehension Models
    Si, Chenglei
    Yang, Ziqing
    Cui, Yiming
    Ma, Wentao
    Liu, Ting
    Wang, Shijin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 634 - 644
  • [4] Design of A Recurrent Neural Network Model for Machine Reading Comprehension
    Singh, Uttam
    Kedas, Shweta
    Prasanth, Sikakollu
    Kumar, Arun
    Semwal, Vijay Bhaskar
    Tikkiwal, Vinay Anand
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1791 - 1800
  • [5] Improving the robustness of machine reading comprehension model with hierarchical knowledge and auxiliary unanswerability prediction
    Wu, Zhijing
    Xu, Hua
    KNOWLEDGE-BASED SYSTEMS, 2020, 203
  • [6] Towards Evaluating the Robustness of Neural Networks
    Carlini, Nicholas
    Wagner, David
    2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, : 39 - 57
  • [7] DuReaderrobust : A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications
    Tang, Hongxuan
    Li, Hongyu
    Liu, Jing
    Hong, Yu
    Wu, Hua
    Wang, Haifeng
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 955 - 963
  • [8] Interpreting and Evaluating Neural Network Robustness
    Yu, Fuxun
    Qin, Zhuwei
    Liu, Chenchen
    Zhao, Liang
    Wang, Yanzhi
    Chen, Xiang
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4199 - 4205
  • [9] The Impacts of Unanswerable Questions on the Robustness of Machine Reading Comprehension Models
    Son Quoc Tran
    Phong Nguyen-Thuan Do
    Le, Uyen
    Kretchmar, Matt
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1543 - 1557
  • [10] Improving the robustness of machine reading comprehension via contrastive learning
    Feng, Jianzhou
    Sun, Jiawei
    Shao, Di
    Cui, Jinman
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9103 - 9114