Automatic question answering for multiple stakeholders, the epidemic question answering dataset

被引:0
|
作者
Travis R. Goodwin
Dina Demner-Fushman
Kyle Lo
Lucy Lu Wang
Hoa T. Dang
Ian M. Soboroff
机构
[1] National Library of Medicine,
[2] Allen Institute for AI,undefined
[3] National Institute of Standards and Technology,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
One of the effects of COVID-19 pandemic is a rapidly growing and changing stream of publications to inform clinicians, researchers, policy makers, and patients about the health, socio-economic, and cultural consequences of the pandemic. Managing this information stream manually is not feasible. Automatic Question Answering can quickly bring the most salient points to the user’s attention. Leveraging a collection of scientific articles, government websites, relevant news articles, curated social media posts, and questions asked by researchers, clinicians, and the general public, we developed a dataset to explore automatic Question Answering for multiple stakeholders. Analysis of questions asked by various stakeholders shows that while information needs of experts and the public may overlap, satisfactory answers to these questions often originate from different information sources or benefit from different approaches to answer generation. We believe that this dataset has the potential to support the development of question answering systems not only for epidemic questions, but for other domains with varying expertise such as legal or finance.
引用
收藏
相关论文
共 50 条
  • [1] Automatic question answering for multiple stakeholders, the epidemic question answering dataset
    Goodwin, Travis R.
    Demner-Fushman, Dina
    Lo, Kyle
    Wang, Lucy Lu
    Dang, Hoa T.
    Soboroff, Ian M.
    [J]. SCIENTIFIC DATA, 2022, 9 (01)
  • [2] Research on Question Classification for Automatic Question Answering
    Xu, Shihua
    Cheng, Gang
    Kong, Fang
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 218 - 221
  • [3] Automatic Spanish Translation of the SQuAD Dataset for Multilingual Question Answering
    Carrino, Casimiro Pio
    Costa-jussa, Marta R.
    Fonollosa, Jose A. R.
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5515 - 5523
  • [4] PQuAD: A Persian question answering dataset
    Darvishi, Kasra
    Shahbodaghkhan, Newsha
    Abbasiantaeb, Zahra
    Momtazi, Saeedeh
    [J]. COMPUTER SPEECH AND LANGUAGE, 2023, 80
  • [5] FQuAD: French Question Answering Dataset
    d'Hoffschmidt, Martin
    Belblidia, Wacim
    Heinrich, Quentin
    Brendle, Tom
    Vidal, Maxime
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1193 - 1208
  • [6] Slovak Dataset for Multilingual Question Answering
    Hladek, Daniel
    Stas, Jan
    Juhar, Jozef
    Koctur, Tomas
    [J]. IEEE ACCESS, 2023, 11 : 32869 - 32881
  • [7] VQuAnDa: Verbalization QUestion ANswering DAtaset
    Kacupaj, Endri
    Zafar, Hamid
    Lehmann, Jens
    Maleshkova, Maria
    [J]. SEMANTIC WEB (ESWC 2020), 2020, 12123 : 531 - 547
  • [8] LLQA - Lifelog Question Answering Dataset
    Tran, Ly-Duyen
    Thanh Cong Ho
    Lan Anh Pham
    Binh Nguyen
    Gurrin, Cathal
    Zhou, Liting
    [J]. MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 217 - 228
  • [9] Question and Answer Classification in Czech Question Answering Benchmark Dataset
    Kusnirakova, Dasa
    Medved, Marek
    Horak, Ales
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 701 - 706
  • [10] Automatic question answering: Beyond the factoid
    Soricut, R
    Brill, E
    [J]. HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 57 - 64