NLQuAD: A Non-Factoid Long Question Answering Data Set

被引:0
|
作者
Soleimani, Amir [1 ]
Monz, Christof [1 ]
Worring, Marcel [1 ]
机构
[1] Univ Amsterdam, Informat Inst, Amsterdam, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce NLQuAD, the first data set with baseline methods for non-factoid long question answering, a task requiring document-level language understanding. In contrast to existing span detection question answering data sets, NLQuAD has non-factoid questions that are not answerable by a short span of text and demanding multiple-sentence descriptive answers and opinions. We show the limitation of the F1 score for evaluation of long answers and introduce Intersection over Union (IoU), which measures position-sensitive overlap between the predicted and the target answer spans. To establish baseline performances, we compare BERT, RoBERTa, and Longformer models. Experimental results and human evaluations show that Longformer outperforms the other architectures, but results are still far behind a human upper bound, leaving substantial room for improvements. NLQuAD's samples exceed the input limitation of most pretrained Transformer-based models, encouraging future research on long sequence language models.(1)
引用
收藏
页码:1245 / 1255
页数:11
相关论文
共 50 条
  • [1] A Non-Factoid Question-Answering Taxonomy
    Bolotova, Valeriia
    Blinov, Vladislav
    Scholer, Falk
    Croft, W. Bruce
    Sanderson, Mark
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1196 - 1207
  • [2] Performance Prediction for Non-Factoid Question Answering
    Hashemi, Helia
    Zamani, Hamed
    Croft, W. Bruce
    [J]. PROCEEDINGS OF THE 2019 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'19), 2019, : 55 - 58
  • [3] Exploring Diversification in Non-factoid Question Answering
    Vikraman, Lakshmi
    Croft, W. Bruce
    O'Connor, Brendan
    [J]. PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 223 - 226
  • [4] Non-factoid Question Answering in the Legal Domain
    McElvain, Gayle
    Sanchez, George
    Teo, Don
    Custis, Tonya
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1395 - 1396
  • [5] Open-Domain Non-factoid Question Answering
    Khvalchik, Maria
    Kulkarni, Anagha
    [J]. TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 290 - 298
  • [6] Answer Interaction in Non-factoid Question Answering Systems
    Qu, Chen
    Yang, Liu
    Croft, W. Bruce
    Scholer, Falk
    Zhang, Yongfeng
    [J]. PROCEEDINGS OF THE 2019 CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL (CHIIR'19), 2019, : 249 - 253
  • [7] A systematic review of question answering systems for non-factoid questions
    Eduardo Gabriel Cortes
    Vinicius Woloszyn
    Dante Barone
    Sebastian Möller
    Renata Vieira
    [J]. Journal of Intelligent Information Systems, 2022, 58 : 453 - 480
  • [8] A systematic review of question answering systems for non-factoid questions
    Cortes, Eduardo Gabriel
    Woloszyn, Vinicius
    Barone, Dante
    Moeller, Sebastian
    Vieira, Renata
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2022, 58 (03) : 453 - 480
  • [9] A non-factoid question answering system for prior art search
    Zihayat, Morteza
    Etwaroo, Rochelle
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 177
  • [10] A SENTIMENT BASED NON-FACTOID QUESTION-ANSWERING FRAMEWORK
    Ye, Qiaofei
    Misra, Kanishka
    Devarapalli, Hemanth
    Rayz, Julia Taylor
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 372 - 377