Benchmarking question answering systems

被引:15
|
作者
Usbeck, Ricardo [1 ]
Roeder, Michael [1 ]
Hoffmann, Michael [3 ]
Conrads, Felix [1 ]
Huthmann, Jonathan [3 ]
Ngonga-Ngomo, Axel-Cyrille [1 ]
Demmler, Christian [3 ]
Unger, Christina [2 ]
机构
[1] Paderborn Univ, DICE Data Sci Grp, Paderborn, Germany
[2] Univ Bielefeld, CITEC, Bielefeld, Germany
[3] Univ Leipzig, AKSW Grp, Leipzig, Germany
基金
欧盟地平线“2020”;
关键词
Factoid question answering; benchmarking; repeatable open research;
D O I
10.3233/SW-180312
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The necessity of making the Semantic Web more accessible for lay users, alongside the uptake of interactive systems and smart assistants for the Web, have spawned a new generation of RDF-based question answering systems. However, fair evaluation of these systems remains a challenge due to the different type of answers that they provide. Hence, repeating current published experiments or even benchmarking on the same datasets remains a complex and time-consuming task. We present a novel online benchmarking platform for question answering (QA) that relies on the FAIR principles to support the fine-grained evaluation of question answering systems. We detail how the platform addresses the fair benchmarking platform of question answering systems through the rewriting of URIs and URLs. In addition, we implement different evaluation metrics, measures, datasets and pre-implemented systems as well as methods to work with novel formats for interactive and non-interactive benchmarking of question answering systems. Our analysis of current frameworks shows that most of the current frameworks are tailored towards particular datasets and challenges but do not provide generic models. In addition, while most frameworks perform well in the annotation of entities and properties, the generation of SPARQL queries from annotated text remains a challenge.
引用
收藏
页码:293 / 304
页数:12
相关论文
共 50 条
  • [41] Arabic Question Answering Systems: Gap Analysis
    Biltawi, Mariam M.
    Tedmori, Sara
    Awajan, Arafat
    [J]. IEEE ACCESS, 2021, 9 : 63876 - 63904
  • [42] Exploiting Opinion Influence in Question Answering Systems
    Cercel, Dumitru-Clementin
    Onose, Cristian
    Trausan-Matu, Stefan
    Pop, Florin
    [J]. 2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 197 - 201
  • [43] A survey of consumer health question answering systems
    Welivita, Anuradha
    Pu, Pearl
    [J]. AI MAGAZINE, 2023, 44 (04) : 482 - 507
  • [44] A Quantitative Evaluation of Natural Language Question Interpretation for Question Answering Systems
    Asakura, Takuto
    Kim, Jin-Dong
    Yamamoto, Yasunori
    Tateisi, Yuka
    Takagi, Toshihisa
    [J]. SEMANTIC TECHNOLOGY (JIST 2018), 2018, 11341 : 215 - 231
  • [45] A Hybrid Approach for Question Classification in Persian Automatic Question Answering Systems
    Sherkat, Ehsan
    Farhoodi, Mojgan
    [J]. 2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 279 - 284
  • [46] Multilingual Question Answering Systems: Question Classification in Spanish based in Learning
    Garcia Cumbreras, Miguel Angel
    Martinez Santiago, Fernando
    Alfonso Urena Lopez, L.
    Montejo Raez, Arturo
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (34):
  • [47] Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics
    Deutsch, Daniel
    Roth, Dan
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3759 - 3765
  • [48] Importance of pronominal anaphora resolution in question answering systems
    Vicedo, JL
    Ferrández, A
    [J]. 38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 555 - 562
  • [49] Knowledge trees and protoforms in question-answering systems
    Yager, RR
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (04): : 550 - 563
  • [50] Natural language asymmetries and the construction of question answering systems
    Di Sciullo, AM
    Aguero, C
    [J]. 7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, PROCEEDINGS: INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2003, : 13 - 18