A study about the future evaluation of Question-Answering systems

被引:22
|
作者
Rodrigo, Alvaro [1 ]
Penas, Anselmo [1 ]
机构
[1] UNED, NLP & IR Grp, Juan Rosal 16, Madrid, Spain
关键词
Question Answering; Evaluation campaigns; Validation; Textual inference; TESTS;
D O I
10.1016/j.knosys.2017.09.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evaluation campaigns of Question Answering (QA) systems have contributed to the development of such technologies. These campaigns have promoted some changes oriented to overcome results. However, at this period we see how systems have reached an upper bound, as well as systems are still far away from answering complex questions. In this paper, we overview the main QA evaluations over free text, paying special attention to the changes encouraged at such campaigns. We observe that systems still return a high proportion of incorrect answers and that the changes are almost not included in traditional approaches. Moreover, we analyze QA collections in order to obtain better insights about the main challenges for current QA systems. We detect that QA systems find very difficult to deal with different rewordings in questions and documents, as well as to infer information that is not explicitly mentioned in texts. Based on those observations, we recommend a set of directions for future evaluations, suggesting the application of textual inference and knowledge bases as a way for improving results. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:83 / 93
页数:11
相关论文
共 50 条
  • [1] A Framework of Evaluation for Question-Answering Systems
    El Ayari, Sarra
    Grau, Brigitte
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 744 - 748
  • [2] Question-Answering Systems: Development and Prospects
    Lapshin, V. A.
    [J]. AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2012, 46 (03) : 138 - 145
  • [3] A survey on legal question-answering systems
    Martinez-Gil, Jorge
    [J]. COMPUTER SCIENCE REVIEW, 2023, 48
  • [4] Question-answering systems: Development and prospects
    V. A. Lapshin
    [J]. Automatic Documentation and Mathematical Linguistics, 2012, 46 (3) : 138 - 145
  • [5] Question-answering systems in knowledge management
    Moldovan, D
    [J]. IEEE INTELLIGENT SYSTEMS, 2001, 16 (06) : 90 - 92
  • [6] Evaluation of Google question-answering quality
    Zhao, Yiming
    Zhang, Jin
    Xia, Xue
    Le, Taowen
    [J]. LIBRARY HI TECH, 2019, 37 (02) : 312 - 328
  • [7] Question-answering systems as efficient sources of terminological information: an evaluation
    Olvera-Lobo, Maria-Dolores
    Gutierrez-Artacho, Juncal
    [J]. HEALTH INFORMATION AND LIBRARIES JOURNAL, 2010, 27 (04): : 268 - 276
  • [8] Empirical Studies in Question-Answering Systems: A Discussion
    Krueger, Jacob
    Schroeter, Ivonne
    Kenner, Andy
    Leich, Thomas
    [J]. 2017 IEEE/ACM 5TH INTERNATIONAL WORKSHOP ON CONDUCTING EMPIRICAL STUDIES IN INDUSTRY (CESI 2017), 2017, : 23 - 26
  • [9] Knowledge trees and protoforms in question-answering systems
    Yager, RR
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (04): : 550 - 563
  • [10] STORAGE ECONOMY OF INFERENTIAL QUESTION-ANSWERING SYSTEMS
    PEARL, J
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1975, 5 (06): : 595 - 602