The SciQA Scientific Question Answering Benchmark for Scholarly Knowledge

被引:4
|
作者
Auer, Soeren [1 ,2 ]
Barone, Dante A. C. [3 ]
Bartz, Cassiano [3 ]
Cortes, Eduardo G. [3 ]
Jaradeh, Mohamad Yaser [1 ,2 ]
Karras, Oliver [1 ]
Koubarakis, Manolis [4 ]
Mouromtsev, Dmitry [5 ]
Pliukhin, Dmitrii [5 ]
Radyush, Daniil [5 ]
Shilin, Ivan [5 ]
Stocker, Markus [1 ,2 ]
Tsalapati, Eleni [4 ]
机构
[1] TIB Leibniz Informat Ctr Sci & Technol, Hannover, Germany
[2] Leibniz Univ Hannover, L3S Res Ctr, Hannover, Germany
[3] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, Brazil
[4] Natl & Kapodistrian Univ Athens, Dept Informat & Telecommun, Athens, Greece
[5] ITMO Univ, Lab Informat Sci & Semant Technol, St Petersburg, Russia
基金
欧盟地平线“2020”; 欧洲研究理事会;
关键词
D O I
10.1038/s41598-023-33607-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Knowledge graphs have gained increasing popularity in the last decade in science and technology. However, knowledge graphs are currently relatively simple to moderate semantic structures that are mainly a collection of factual statements. Question answering (QA) benchmarks and systems were so far mainly geared towards encyclopedic knowledge graphs such as DBpedia and Wikidata. We present SciQA a scientific QA benchmark for scholarly knowledge. The benchmark leverages the Open Research Knowledge Graph (ORKG) which includes almost 170,000 resources describing research contributions of almost 15,000 scholarly articles from 709 research fields. Following a bottom-up methodology, we first manually developed a set of 100 complex questions that can be answered using this knowledge graph. Furthermore, we devised eight question templates with which we automatically generated further 2465 questions, that can also be answered with the ORKG. The questions cover a range of research fields and question types and are translated into corresponding SPARQL queries over the ORKG. Based on two preliminary evaluations, we show that the resulting SciQA benchmark represents a challenging task for next-generation QA systems. This task is part of the open competitions at the 22nd International Semantic Web Conference 2023 as the Scholarly Question Answering over Linked Data (QALD) Challenge.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] BBQ: A Hand-Built Bias Benchmark for Question Answering
    Parrish, Alicia
    Chen, Angelica
    Nangia, Nikita
    Padmakumar, Vishakh
    Phang, Jason
    Thompson, Jana
    Phu Mon Htut
    Bowman, Samuel R.
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2086 - 2105
  • [32] SelQA: A New Benchmark for Selection-based Question Answering
    Jurczyk, Tomasz
    Zhai, Michael
    Choi, Jinho D.
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 820 - 827
  • [33] Knowledge Graph Based Question Routing for Community Question Answering
    Liu, Zhu
    Li, Kan
    Qu, Dacheng
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 721 - 730
  • [34] Knowledge Base Question Answering With Attentive Pooling for Question Representation
    Wang, Run-Ze
    Ling, Zhen-Hua
    Hu, Yu
    [J]. IEEE ACCESS, 2019, 7 : 46773 - 46784
  • [35] Knowledge and reasoning for question answering: Research perspectives
    Saint-Dizier, Patrick
    Moens, Marie-Francine
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (06) : 899 - 906
  • [36] Joint Knowledge Graph Completion and Question Answering
    Liu, Lihui
    Du, Boxin
    Xu, Jiejun
    Xia, Yinglong
    Tong, Hanghang
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1098 - 1108
  • [37] QUESTION-ANSWERING STRATEGIES AND CONCEPTUAL KNOWLEDGE
    SINGER, M
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1991, 29 (02) : 143 - 146
  • [38] Research on the method of knowledge base question answering
    Jin, Tao
    Wang, Hai-Jun
    [J]. 2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 527 - 530
  • [39] A Survey: Complex Knowledge Base Question Answering
    Luo, Yuxin
    Yang, Bailong
    Xu, Donghui
    Tian, Luogeng
    [J]. 2022 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING (ICICSE 2022), 2022, : 46 - 52
  • [40] Complex Knowledge Base Question Answering: A Survey
    Lan, Yunshi
    He, Gaole
    Jiang, Jinhao
    Jiang, Jing
    Zhao, Wayne Xin
    Wen, Ji-Rong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11196 - 11215