Machine learning for query formulation in question answering

被引:5
|
作者
Monz, Christof [1 ]
机构
[1] Univ Amsterdam, Inst Informat, NL-1098 XG Amsterdam, Netherlands
关键词
RETRIEVAL;
D O I
10.1017/S1351324910000276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research on question answering dates back to the 1960s but has more recently been revisited as part of TREC's evaluation campaigns, where question answering is addressed as a subarea of information retrieval that focuses on specific answers to a user's information need. Whereas document retrieval systems aim to return the documents that are most relevant to a user's query, question answering systems aim to return actual answers to a users question. Despite this difference, question answering systems rely on information retrieval components to identify documents that contain an answer to a user's question. The computationally more expensive answer extraction methods are then applied only to this subset of documents that are likely to contain an answer. As information retrieval methods are used to filter the documents in the collection, the performance of this component is critical as documents that are not retrieved are not analyzed by the answer extraction component. The formulation of queries that are used for retrieving those documents has a strong impact on the effectiveness of the retrieval component. In this paper, we focus on predicting the importance of terms from the original question. We use model tree machine learning techniques in order to assign weights to query terms according to their usefulness for identifying documents that contain an answer. Term weights are learned by inspecting a large number of query formulation variations and their respective accuracy in identifying documents containing an answer. Several linguistic features are used for building the models, including part-of-speech tags, degree of connectivity in the dependency parse tree of the question, and ontological information. All of these features are extracted automatically by using several natural language processing tools. Incorporating the learned weights into a state-of-the-art retrieval system results in statistically significant improvements in identifying answer-bearing documents.
引用
收藏
页码:425 / 454
页数:30
相关论文
共 50 条
  • [21] The research on query expansion for chinese question answering system
    Yu, ZT
    Fan, XZ
    Song, LR
    Guo, JY
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 1, PROCEEDINGS, 2005, 3613 : 571 - 579
  • [22] Improving Question Answering based on Query Expansion with Wikipedia
    Miao, Yajie
    Su, Xin
    Li, Chunping
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 2, 2010, : 233 - 240
  • [23] Cognitive Semiotic Model for Query Expansion in Question Answering
    Sirenko, Alexander
    Cherkasova, Galina
    Philippovich, Yuriy
    Karaulov, Yuriy
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, 2014, 436 : 222 - 228
  • [24] QUESTION ANSWERING AND LEARNING WITH HYPERTEXT
    ROUET, JF
    LESSONS FROM LEARNING, 1994, 46 : 39 - 52
  • [25] Joint Learning of Question Answering and Question Generation
    Sun, Yibo
    Tang, Duyu
    Duan, Nan
    Qin, Tao
    Liu, Shujie
    Yan, Zhao
    Zhou, Ming
    Lv, Yuanhua
    Yin, Wenpeng
    Feng, Xiaocheng
    Qin, Bing
    Liu, Ting
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (05) : 971 - 982
  • [26] Enhancing SPARQL Query Generation for Knowledge Base Question Answering Systems by Learning to Correct Triplets
    Qi, Jiexing
    Su, Chang
    Guo, Zhixin
    Wu, Lyuwen
    Shen, Zanwei
    Fu, Luoyi
    Wang, Xinbing
    Zhou, Chenghu
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [27] Efficient Management and Optimization of Very Large Machine Learning Dataset for Question Answering
    Medved, Marek
    Sabol, Radoslav
    Horak, Ales
    RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING (RASLAN 2020), 2020, : 23 - 34
  • [28] Using Dependency Parsing and Machine Learning for Factoid Question Answering on Spoken Documents
    Comas, Pere R.
    Turmo, Jordi
    Marquez, Lluis
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1265 - 1268
  • [29] A Scheme of Answer Selection In Community Question Answering Using Machine Learning Techniques
    Wakchaure, Mohini
    Kulkarni, Prakash
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 879 - 883
  • [30] Machine learning based review on Development and Classification of Question-Answering Systems
    Uttarwar, Sayli
    Gambani, Simran
    Thakkar, Tej
    Mulla, Nikahat
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 359 - 366