Machine learning for query formulation in question answering

被引:5
|
作者
Monz, Christof [1 ]
机构
[1] Univ Amsterdam, Inst Informat, NL-1098 XG Amsterdam, Netherlands
关键词
RETRIEVAL;
D O I
10.1017/S1351324910000276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research on question answering dates back to the 1960s but has more recently been revisited as part of TREC's evaluation campaigns, where question answering is addressed as a subarea of information retrieval that focuses on specific answers to a user's information need. Whereas document retrieval systems aim to return the documents that are most relevant to a user's query, question answering systems aim to return actual answers to a users question. Despite this difference, question answering systems rely on information retrieval components to identify documents that contain an answer to a user's question. The computationally more expensive answer extraction methods are then applied only to this subset of documents that are likely to contain an answer. As information retrieval methods are used to filter the documents in the collection, the performance of this component is critical as documents that are not retrieved are not analyzed by the answer extraction component. The formulation of queries that are used for retrieving those documents has a strong impact on the effectiveness of the retrieval component. In this paper, we focus on predicting the importance of terms from the original question. We use model tree machine learning techniques in order to assign weights to query terms according to their usefulness for identifying documents that contain an answer. Term weights are learned by inspecting a large number of query formulation variations and their respective accuracy in identifying documents containing an answer. Several linguistic features are used for building the models, including part-of-speech tags, degree of connectivity in the dependency parse tree of the question, and ontological information. All of these features are extracted automatically by using several natural language processing tools. Incorporating the learned weights into a state-of-the-art retrieval system results in statistically significant improvements in identifying answer-bearing documents.
引用
收藏
页码:425 / 454
页数:30
相关论文
共 50 条
  • [31] Semantic based Query Expansion for Arabic Question Answering Systems
    Al-Chalabi, Hani
    Ray, Santosh
    Shaalan, Khaled
    2015 FIRST INTERNATIONAL CONFERENCE ON ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2015): ADVANCES IN ARABIC COMPUTATIONAL LINGUISTICS, 2015, : 127 - 132
  • [32] Question Answering over Knowledge Graphs with Query Path Generation
    Yang, Linqing
    Guo, Kecen
    Liu, Bo
    Gong, Jiazheng
    Zhang, Zhujian
    Zhao, Peiyu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2022, 13368 : 146 - 158
  • [33] Mining Query Subtopics from Questions in Community Question Answering
    Wu, Yu
    Wu, Wei
    Li, Zhoujun
    Zhou, Ming
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 339 - 345
  • [34] Complex Query Augmentation for Question Answering over Knowledge Graphs
    Abdelkawi, Abdelrahman
    Zafar, Hamid
    Maleshkova, Maria
    Lehmann, Jens
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2019 CONFERENCES, 2019, 11877 : 571 - 587
  • [35] Formal Query Generation for Question Answering over Knowledge Bases
    Zafar, Hamid
    Napolitano, Giulio
    Lehmann, Jens
    SEMANTIC WEB (ESWC 2018), 2018, 10843 : 714 - 728
  • [36] Query Context Expansion for Open-Domain Question Answering
    Zhu, Wenhao
    Zhang, Xiaoyu
    Ye, Liang
    Zhai, Qiuhong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (08)
  • [37] IQA: Interactive query construction in semantic question answering systems
    Zafar, Hamid
    Dubey, Mohnish
    Lehmann, Jens
    Demidova, Elena
    JOURNAL OF WEB SEMANTICS, 2020, 64 (64):
  • [38] Study and Development of Question Answering System based on Ontology Query
    Liu, Xiaoqiang
    Guo, Zhenbo
    Wang, Kaixi
    Jiang, Wenxu
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND COMPUTER APPLICATION, 2016, 30 : 430 - 432
  • [39] Investigating Query Expansion and Coreference Resolution in Question Answering on BERT
    Bhattacharjee, Santanu
    Haque, Rejwanul
    Wenniger, Gideon Maillette De Buy
    Way, Andy
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2020), 2020, 12089 : 47 - 59
  • [40] Commonsense Properties from Query Logs and Question Answering Forums
    Romero, Julien
    Razniewski, Simon
    Pal, Koninika
    Pan, Jeff Z.
    Sakhadeo, Archit
    Weikum, Gerhard
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1411 - 1420