Hybrid query expansion using lexical resources and word embeddings for sentence retrieval in question answering

被引:84
|
作者
Esposito, Massimo [1 ]
Damiano, Ernanuele [1 ]
Minutolo, Aniello [1 ]
De Pietro, Giuseppe [1 ]
Fujita, Hamido [2 ]
机构
[1] Natl Res Council Italy, Inst High Performance Comp & Networking ICAR, Naples, Italy
[2] Iwate Prefecture Univ, Takizawa, Iwate, Japan
关键词
Query expansion; Question-answering; Information retrieval; Lexical resources; Word embeddings; Sentence retrieval;
D O I
10.1016/j.ins.2019.12.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Question Answering (QA) systems based on Information Retrieval return precise answers to natural language questions, extracting relevant sentences from document collections. However, questions and sentences cannot be aligned terminologically, generating errors in the sentence retrieval. In order to augment the effectiveness in retrieving relevant sentences from documents, this paper proposes a hybrid Query Expansion (QE) approach, based on lexical resources and word embeddings, for QA systems. In detail, synonyms and hypernyms of relevant terms occurring in the question are first extracted from MultiWordNet and, then, contextualized to the document collection used in the QA system. Finally, the resulting set is ranked and filtered on the basis of wording and sense of the question, by employing a semantic similarity metric built on the top of a Word2Vec model. This latter is locally trained on an extended corpus pertaining the same topic of the documents used in the QA system. This QE approach is implemented into an existing QA system and experimentally evaluated, with respect to different possible configurations and selected baselines, for the Italian language and in the Cultural Heritage domain, assessing its effectiveness in retrieving sentences containing proper answers to questions belonging to four different categories. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:88 / 105
页数:18
相关论文
共 50 条
  • [1] Enhancing Question Retrieval in Community Question Answering Using Word Embeddings
    Othman, Nouha
    Faiz, Rim
    Smaili, Kamel
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 485 - 494
  • [2] Query Expansion Using Word Embeddings
    Kuzi, Saar
    Shtok, Anna
    Kurland, Oren
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1929 - 1932
  • [3] A hybrid question answering schema using encapsulated semantics in lexical resources
    Ofoghi, Bahadorreza
    Yearwood, John
    Ghosh, Ranadhir
    [J]. AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 1276 - +
  • [4] Query Expansion With Local Conceptual Word Embeddings in Microblog Retrieval
    Wang, Yashen
    Huang, Heyan
    Feng, Chong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1737 - 1749
  • [5] Query expansion for answer document retrieval in Chinese Question answering system
    Yu, ZT
    Zheng, ZY
    Tang, SP
    Guo, JY
    [J]. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 72 - 77
  • [6] Query Expansion based on Word Embeddings and Ontologies for Efficient Information Retrieval
    Rastogi, Namrata
    Verma, Parul
    Kumar, Pankaj
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 367 - 373
  • [7] Word embeddings and external resources for answer processing in biomedical factoid question answering
    Dimitriadis, Dimitris
    Tsoumakas, Grigorios
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 92
  • [8] Yahoo! Answers for Sentence Retrieval in Question Answering
    Momtazi, Saeedeh
    Klakow, Dietrich
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : D28 - D35
  • [9] Personalized Query Expansion with Contextual Word Embeddings
    Bassani, Elias
    Tonellotto, Nicola
    Pasi, Gabriella
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (02)
  • [10] QUALIFIER: Question answering by lexical fabric and external resources
    Yang, H
    Chua, TS
    [J]. EACL 2003: 10TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2003, : 363 - 370