Automatic question answering for multiple stakeholders, the epidemic question answering dataset

被引:0
|
作者
Travis R. Goodwin
Dina Demner-Fushman
Kyle Lo
Lucy Lu Wang
Hoa T. Dang
Ian M. Soboroff
机构
[1] National Library of Medicine,
[2] Allen Institute for AI,undefined
[3] National Institute of Standards and Technology,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
One of the effects of COVID-19 pandemic is a rapidly growing and changing stream of publications to inform clinicians, researchers, policy makers, and patients about the health, socio-economic, and cultural consequences of the pandemic. Managing this information stream manually is not feasible. Automatic Question Answering can quickly bring the most salient points to the user’s attention. Leveraging a collection of scientific articles, government websites, relevant news articles, curated social media posts, and questions asked by researchers, clinicians, and the general public, we developed a dataset to explore automatic Question Answering for multiple stakeholders. Analysis of questions asked by various stakeholders shows that while information needs of experts and the public may overlap, satisfactory answers to these questions often originate from different information sources or benefit from different approaches to answer generation. We believe that this dataset has the potential to support the development of question answering systems not only for epidemic questions, but for other domains with varying expertise such as legal or finance.
引用
收藏
相关论文
共 50 条
  • [31] Multiple answers to a question: a new approach for visual question answering
    Hosseinabad, Sayedshayan Hashemi
    Safayani, Mehran
    Mirzaei, Abdolreza
    [J]. VISUAL COMPUTER, 2021, 37 (01): : 119 - 131
  • [32] RuBQ 2.0: An Innovated Russian Question Answering Dataset
    Rybin, Ivan
    Korablinov, Vladislav
    Efimov, Pavel
    Braslavski, Pavel
    [J]. SEMANTIC WEB, ESWC 2021, 2021, 12731 : 532 - 547
  • [33] Building a benchmark dataset for the Kurdish news question answering
    Saeed, Ari M.
    [J]. DATA IN BRIEF, 2024, 57
  • [34] EgoVQA - An Egocentric Video Question Answering Benchmark Dataset
    Fan, Chenyou
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4359 - 4366
  • [35] OVQA: A Clinically Generated Visual Question Answering Dataset
    Huang, Yefan
    Wang, Xiaoli
    Liu, Feiyan
    Huang, Guofeng
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2924 - 2938
  • [36] A Large Visual Question Answering Dataset for Cultural Heritage
    Asprino, Luigi
    Bulla, Luana
    Marinucci, Ludovica
    Mongiovi, Misael
    Presutti, Valentina
    [J]. MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE (LOD 2021), PT II, 2022, 13164 : 193 - 197
  • [37] A dataset for medical instructional video classification and question answering
    Gupta, Deepak
    Attal, Kush
    Demner-Fushman, Dina
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [38] TwEETQA: A Social Media Focused Question Answering Dataset
    Xiong, Wenhan
    Wu, Jiawei
    Wang, Hong
    Kulkarni, Vivek
    Yu, Mo
    Chang, Shiyu
    Guo, Xiaoxiao
    Wang, William Yang
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5020 - 5031
  • [39] ToolQA: A Dataset for LLM Question Answering with External Tools
    Zhuang, Yuchen
    Yu, Yue
    Wang, Kuan
    Sun, Haotian
    Zhang, Chao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [40] DAWQAS: A Dataset for Arabic Why Question Answering System
    Ismail, Walaa Saber
    Homsi, Masun Nabhan
    [J]. ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 123 - 131