Machine learning for query formulation in question answering

被引:5
|
作者
Monz, Christof [1 ]
机构
[1] Univ Amsterdam, Inst Informat, NL-1098 XG Amsterdam, Netherlands
关键词
RETRIEVAL;
D O I
10.1017/S1351324910000276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research on question answering dates back to the 1960s but has more recently been revisited as part of TREC's evaluation campaigns, where question answering is addressed as a subarea of information retrieval that focuses on specific answers to a user's information need. Whereas document retrieval systems aim to return the documents that are most relevant to a user's query, question answering systems aim to return actual answers to a users question. Despite this difference, question answering systems rely on information retrieval components to identify documents that contain an answer to a user's question. The computationally more expensive answer extraction methods are then applied only to this subset of documents that are likely to contain an answer. As information retrieval methods are used to filter the documents in the collection, the performance of this component is critical as documents that are not retrieved are not analyzed by the answer extraction component. The formulation of queries that are used for retrieving those documents has a strong impact on the effectiveness of the retrieval component. In this paper, we focus on predicting the importance of terms from the original question. We use model tree machine learning techniques in order to assign weights to query terms according to their usefulness for identifying documents that contain an answer. Term weights are learned by inspecting a large number of query formulation variations and their respective accuracy in identifying documents containing an answer. Several linguistic features are used for building the models, including part-of-speech tags, degree of connectivity in the dependency parse tree of the question, and ontological information. All of these features are extracted automatically by using several natural language processing tools. Incorporating the learned weights into a state-of-the-art retrieval system results in statistically significant improvements in identifying answer-bearing documents.
引用
收藏
页码:425 / 454
页数:30
相关论文
共 50 条
  • [41] Deep Query Ranking for Question Answering over Knowledge Bases
    Zafar, Hamid
    Napolitano, Giulio
    Lehmann, Jens
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT III, 2019, 11053 : 635 - 638
  • [42] quEHRy: a question answering system to query electronic health records
    Soni, Sarvesh
    Datta, Surabhi
    Roberts, Kirk
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (06) : 1091 - 1102
  • [43] SPEECH-DRIVEN QUERY RETRIEVAL FOR QUESTION-ANSWERING
    Mishra, Taniya
    Bangalore, Srinivas
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5318 - 5321
  • [44] Query completion in community-based Question Answering search
    Mao, Xian-Ling
    Hao, Yi-Jing
    Wang, Dan
    Huang, Heyan
    NEUROCOMPUTING, 2018, 274 : 3 - 7
  • [45] Interaction history based answer formulation for question answering
    Perera, Rivindu (rivindu.perera@aut.ac.nz), 1600, Springer Verlag (468):
  • [46] Learning to Rank for Question Routing in Community Question Answering
    Ji, Zongcheng
    Wang, Bin
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2363 - 2368
  • [47] Multi-Question Learning for Visual Question Answering
    Lei, Chenyi
    Wu, Lei
    Liu, Dong
    Li, Zhao
    Wang, Guoxin
    Tang, Haihong
    Li, Houqiang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11328 - 11335
  • [48] Knowledge Base Question Answering via Structured Query Generation using Question domain
    Li, Jiecheng
    Peng, Zizhen
    Zhu, Xiaoying
    Lu, Keda
    2022 IEEE 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS, IUCC/CIT/DSCI/SMARTCNS, 2022, : 394 - 400
  • [49] Interaction History Based Answer Formulation for Question Answering
    Perera, Rivindu
    Nand, Parma
    KNOWLEDGE ENGINEERING AND THE SEMANTIC WEB, KESW 2014, 2014, 468 : 128 - 139
  • [50] Neural Learning for Question Answering in Italian
    Croce, Danilo
    Zelenanska, Alexandra
    Basili, Roberto
    AI*IA 2018 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, 11298 : 389 - 402