BioASQ-QA: A manually curated corpus for Biomedical Question Answering

被引:4
|
作者
Krithara, Anastasia [1 ]
Nentidis, Anastasios [1 ,2 ]
Bougiatiotis, Konstantinos [1 ]
Paliouras, Georgios [1 ]
机构
[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Athens, Greece
[2] Aristotle Univ Thessaloniki, Sch Informat, Thessaloniki, Greece
关键词
D O I
10.1038/s41597-023-02068-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The BioASQ question answering (QA) benchmark dataset contains questions in English, along with golden standard (reference) answers and related material. The dataset has been designed to reflect real information needs of biomedical experts and is therefore more realistic and challenging than most existing datasets. Furthermore, unlike most previous QA benchmarks that contain only exact answers, the BioASQ-QA dataset also includes ideal answers (in effect summaries), which are particularly useful for research on multi-document summarization. The dataset combines structured and unstructured data. The materials linked with each question comprise documents and snippets, which are useful for Information Retrieval and Passage Retrieval experiments, as well as concepts that are useful in concept-to-text Natural Language Generation. Researchers working on paraphrasing and textual entailment can also measure the degree to which their methods improve the performance of biomedical QA systems. Last but not least, the dataset is continuously extended, as the BioASQ challenge is running and new data are generated.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] BioASQ-QA: A manually curated corpus for Biomedical Question Answering
    Anastasia Krithara
    Anastasios Nentidis
    Konstantinos Bougiatiotis
    Georgios Paliouras
    [J]. Scientific Data, 10
  • [2] BioASQ: A Challenge on Large-Scale Biomedical Semantic Indexing and Question Answering
    Balikas, Georgios
    Krithara, Anastasia
    Partalas, Ioannis
    Paliouras, George
    [J]. MULTIMODAL RETRIEVAL IN THE MEDICAL DOMAIN, MRMD 2015, 2015, 9059 : 26 - 39
  • [3] Yes/No Question Answering in BioASQ 2019
    Dimitriadis, Dimitris
    Tsoumakas, Grigorios
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 661 - 669
  • [4] Overview of BioASQ 2022: The Tenth BioASQ Challenge on Large-Scale Biomedical Semantic Indexing and Question Answering
    Nentidis, Anastasios
    Katsimpras, Georgios
    Vandorou, Eirini
    Krithara, Anastasia
    Miranda-Escalada, Antonio
    Gasco, Luis
    Krallinger, Martin
    Paliouras, Georgios
    [J]. EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION (CLEF 2022), 2022, 13390 : 337 - 361
  • [5] An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition
    Tsatsaronis, George
    Balikas, Georgios
    Malakasiotis, Prodromos
    Partalas, Ioannis
    Zschunke, Matthias
    Alvers, Michael R.
    Weissenborn, Dirk
    Krithara, Anastasia
    Petridis, Sergios
    Polychronopoulos, Dimitris
    Almirantis, Yannis
    Pavlopoulos, John
    Baskiotis, Nicolas
    Gallinari, Patrick
    Artieres, Thierry
    Ngomo, Axel-Cyrille Ngonga
    Heino, Norman
    Gaussier, Eric
    Barrio-Alvers, Liliana
    Schroeder, Michael
    Androutsopoulos, Ion
    Paliouras, Georgios
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [6] An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition
    George Tsatsaronis
    Georgios Balikas
    Prodromos Malakasiotis
    Ioannis Partalas
    Matthias Zschunke
    Michael R Alvers
    Dirk Weissenborn
    Anastasia Krithara
    Sergios Petridis
    Dimitris Polychronopoulos
    Yannis Almirantis
    John Pavlopoulos
    Nicolas Baskiotis
    Patrick Gallinari
    Thierry Artiéres
    Axel-Cyrille Ngonga Ngomo
    Norman Heino
    Eric Gaussier
    Liliana Barrio-Alvers
    Michael Schroeder
    Ion Androutsopoulos
    Georgios Paliouras
    [J]. BMC Bioinformatics, 16
  • [7] Transformer Models for Question Answering at BioASQ 2019
    Resta, Michele
    Arioli, Daniele
    Fagnani, Alessandro
    Attardi, Giuseppe
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 711 - 726
  • [8] A Mixed Information Source Approach for Biomedical Question Answering: MindLab at BioASQ 7B
    Pineda-Vargas, Monica
    Rosso-Mateus, Andres
    Gonzalez, Fabio A.
    Montes-y-Gomez, Manuel
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 595 - 606
  • [9] BioASQ at CLEF2022: The Tenth Edition of the Large-scale Biomedical Semantic Indexing and Question Answering Challenge
    Nentidis, Anastasios
    Krithara, Anastasia
    Paliouras, Georgios
    Gasco, Luis
    Krallinger, Martin
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PT II, 2022, 13186 : 429 - 435
  • [10] MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset
    Li, Jing
    Zhong, Shangping
    Chen, Kaizhi
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8862 - 8874