Machine learning for query formulation in question answering

被引:5
|
作者
Monz, Christof [1 ]
机构
[1] Univ Amsterdam, Inst Informat, NL-1098 XG Amsterdam, Netherlands
关键词
RETRIEVAL;
D O I
10.1017/S1351324910000276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research on question answering dates back to the 1960s but has more recently been revisited as part of TREC's evaluation campaigns, where question answering is addressed as a subarea of information retrieval that focuses on specific answers to a user's information need. Whereas document retrieval systems aim to return the documents that are most relevant to a user's query, question answering systems aim to return actual answers to a users question. Despite this difference, question answering systems rely on information retrieval components to identify documents that contain an answer to a user's question. The computationally more expensive answer extraction methods are then applied only to this subset of documents that are likely to contain an answer. As information retrieval methods are used to filter the documents in the collection, the performance of this component is critical as documents that are not retrieved are not analyzed by the answer extraction component. The formulation of queries that are used for retrieving those documents has a strong impact on the effectiveness of the retrieval component. In this paper, we focus on predicting the importance of terms from the original question. We use model tree machine learning techniques in order to assign weights to query terms according to their usefulness for identifying documents that contain an answer. Term weights are learned by inspecting a large number of query formulation variations and their respective accuracy in identifying documents containing an answer. Several linguistic features are used for building the models, including part-of-speech tags, degree of connectivity in the dependency parse tree of the question, and ontological information. All of these features are extracted automatically by using several natural language processing tools. Incorporating the learned weights into a state-of-the-art retrieval system results in statistically significant improvements in identifying answer-bearing documents.
引用
收藏
页码:425 / 454
页数:30
相关论文
共 50 条
  • [1] Web-based unsupervised learning for query formulation in question answering
    Wang, YC
    Wu, JC
    Liang, T
    Chang, JS
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 519 - 529
  • [2] Model tree learning for query term weighting in question answering
    Monz, Christof
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 589 - 596
  • [3] A Machine Learning Approach for Ranking in Question Answering
    Amato, Alba
    Coronato, Antonio
    ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC-2017), 2018, 13 : 89 - 98
  • [4] A Machine Learning Approach for Factoid Question Answering
    Sal, David Dominguez
    Surdeanu, Mihai
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2006, (37): : 131 - 136
  • [5] A reinforcement learning formulation to the complex question answering problem
    Chali, Yllias
    Hasan, Sadid A.
    Mojahid, Mustapha
    INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (03) : 252 - 272
  • [6] A machine learning approach to introspection in a question answering system
    Czuba, K
    Prager, J
    Chu-Carroll, J
    PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 265 - 272
  • [7] Using machine learning and text mining in question answering
    Juarez-Gonzalez, Antonio
    Tellez-Valero, Alberto
    Denicia-Carral, Claudia
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    Evaluation of Multilingual and Multi-modal Information Retrieval, 2007, 4730 : 415 - 423
  • [8] Machine learning for question answering from tabular data
    Khalid, Mahboob Alam
    Jijkoun, Valentin
    de Rijke, Maarten
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 392 - +
  • [9] A machine learning approach for Indonesian question answering system
    Purwarianti, Ayu
    Tsuchiya, Masatoshi
    Nakagawa, Seiichi
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2007, : 537 - +
  • [10] Question Answering System using Machine Learning Techniques
    Dobrescu, Alexandra-Maria
    Radu, Serban
    VISION 2025: EDUCATION EXCELLENCE AND MANAGEMENT OF INNOVATIONS THROUGH SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE, 2019, : 10226 - 10237