Improving the Robustness of Question Answering Systems to Question Paraphrasing

被引:0
|
作者
Gan, Wee Chung [1 ]
Ng, Hwee Tou [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the advancement of question answering (QA) systems and rapid improvements on held-out test sets, their generalizability is a topic of concern. We explore the robustness of QA models to question paraphrasing by creating two test sets consisting of paraphrased SQuAD questions. Paraphrased questions from the first test set are very similar to the original questions designed to test QA models' over-sensitivity, while questions from the second test set are paraphrased using context words near an incorrect answer candidate in an attempt to confuse QA models. We show that both paraphrased test sets lead to significant decrease in performance on multiple state-of-the-art QA models. Using a neural paraphrasing model trained to generate multiple paraphrased questions for a given source question and a set of paraphrase suggestions, we propose a data augmentation approach that requires no human intervention to re-train the models for improved robustness to question paraphrasing.
引用
收藏
页码:6065 / 6075
页数:11
相关论文
共 50 条
  • [1] Effects of structural matching and paraphrasing in question answering
    Takahashi, T
    Nawata, K
    Inui, K
    Matsumoto, Y
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (09): : 1677 - 1685
  • [2] Improving the robustness to recognition errors in speech input question answering
    Tsutsui, Hideki
    Manabe, Toshihiko
    Fukui, Mika
    Sakai, Tetsuya
    Fujii, Hiroko
    Urata, Koji
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 297 - 312
  • [3] Clean and Learn: Improving Robustness to Spurious Solutions in API Question Answering
    Yuan, Shuai
    Qin, Haozhe
    Gu, Xiaodong
    Shen, Beijun
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2022, 32 (07) : 1101 - 1123
  • [4] Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
    Bartolo, Max
    Thrush, Tristan
    Jia, Robin
    Riedel, Sebastian
    Stenetorp, Pontus
    Kiela, Douwe
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8830 - 8848
  • [5] Question Answering of Bar Exams by Paraphrasing and Legal Text Analysis
    Kim, Mi-Young
    Xu, Ying
    Lu, Yao
    Goebel, Randy
    [J]. NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2017, 10247 : 299 - 313
  • [6] Question/Answering Systems
    Visser, Ubbo
    [J]. KUNSTLICHE INTELLIGENZ, 2012, 26 (02): : 191 - 195
  • [7] QUESTION ANSWERING SYSTEMS
    Tomljanovic, Jasminka
    Krsnik, Marina
    Pavlic, Mile
    [J]. ZBORNIK VELEUCILISTA U RIJECI-JOURNAL OF THE POLYTECHNICS OF RIJEKA, 2014, 2 (01): : 177 - 195
  • [8] Question Classification for Arabic Question Answering Systems
    Al Chalabi, Hani Maluf
    Ray, Santosh Kumar
    Shaalan, Khaled
    [J]. 2015 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY RESEARCH (ICTRC), 2015, : 310 - 313
  • [9] User Feedback for Improving Question Categorization in Web-Based Question Answering Systems
    Song, Wanpeng
    Liu Wenyin
    Gu, Naijie
    Quan, Xiaojun
    [J]. ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 148 - +
  • [10] Knowledge-based Question Answering by Jointly Generating, Copying and Paraphrasing
    Zhu, Shuguang
    Cheng, Xiang
    Su, Sen
    Lang, Shuang
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2439 - 2442