A study about the future evaluation of Question-Answering systems

被引:22
|
作者
Rodrigo, Alvaro [1 ]
Penas, Anselmo [1 ]
机构
[1] UNED, NLP & IR Grp, Juan Rosal 16, Madrid, Spain
关键词
Question Answering; Evaluation campaigns; Validation; Textual inference; TESTS;
D O I
10.1016/j.knosys.2017.09.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evaluation campaigns of Question Answering (QA) systems have contributed to the development of such technologies. These campaigns have promoted some changes oriented to overcome results. However, at this period we see how systems have reached an upper bound, as well as systems are still far away from answering complex questions. In this paper, we overview the main QA evaluations over free text, paying special attention to the changes encouraged at such campaigns. We observe that systems still return a high proportion of incorrect answers and that the changes are almost not included in traditional approaches. Moreover, we analyze QA collections in order to obtain better insights about the main challenges for current QA systems. We detect that QA systems find very difficult to deal with different rewordings in questions and documents, as well as to infer information that is not explicitly mentioned in texts. Based on those observations, we recommend a set of directions for future evaluations, suggesting the application of textual inference and knowledge bases as a way for improving results. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:83 / 93
页数:11
相关论文
共 50 条
  • [31] Multilingual Question-Answering System in Biomedical Domain on the Web: An Evaluation
    Olvera-Lobo, Maria-Dolores
    Gutierrez-Artacho, Juncal
    [J]. MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS EVALUATION, 2011, 6941 : 83 - +
  • [32] FRAME-BASED INTERFACE FOR QUESTION-ANSWERING SYSTEMS.
    Takagi, Toshihisa
    Matsuo, Fumihiro
    Ushijima, Kazuo
    [J]. Proceedings - IEEE Computer Society's International Computer Software & Applications Conference, 1985, : 388 - 393
  • [33] FACT RETRIEVAL + DEDUCTIVE QUESTION-ANSWERING INFORMATION RETRIEVAL SYSTEMS
    COOPER, WS
    [J]. JOURNAL OF THE ACM, 1964, 11 (02) : 117 - &
  • [34] A question-answering system using argumentation
    Moreale, E
    Vargas-Vera, M
    [J]. MICAI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 2972 : 400 - 409
  • [35] Conditional Generation with a Question-Answering Blueprint
    Narayan, Shashi
    Maynez, Joshua
    Amplayo, Reinald Kim
    Ganchev, Kuzman
    Louis, Annie
    Huot, Fantine
    Sandholm, Anders
    Das, Dipanjan
    Lapata, Mirella
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 974 - 996
  • [36] A GRAMMAR BASE QUESTION-ANSWERING PROCEDURE
    ROSENBAUM, PS
    [J]. COMMUNICATIONS OF THE ACM, 1967, 10 (10) : 630 - +
  • [37] Disambiguation for Arabic Question-Answering System
    Dardour, Sondes
    Fehri, Hela
    Haddar, Kais
    [J]. FORMALIZING NATURAL LANGUAGES WITH NOOJ 2019 AND ITS NATURAL LANGUAGE PROCESSING APPLICATIONS, NOOJ 2019, 2020, 1153 : 101 - 111
  • [38] MemoriQA: A Question-Answering Lifelog Dataset
    Tran, Quang-Linh
    Nguyen, Binh
    Jones, Gareth J. F.
    Gurrin, Cathal
    [J]. PROCEEDINGS OF THE FIRST ACM WORKSHOP ON AI-POWERED QUESTION ANSWERING SYSTEMS FOR MULTIMEDIA, AIQAM 2024, 2024, : 7 - 12
  • [39] Wedding Dress Question-Answering System
    Liu, Yiwei
    Zhang, Yizhuo
    Wei, Yu-Chih
    Chen, Chi-Hua
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [40] Superficial processing in question-answering activities
    Cerdan, Raquel
    Gilabert, Ramiro
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 266 - 266