Enhancing Question Retrieval in Community Question Answering Using Word Embeddings

被引:15
|
作者
Othman, Nouha [1 ]
Faiz, Rim [2 ]
Smaili, Kamel [3 ]
机构
[1] Univ Tunis, ISG Tunis, LARODEC, Bardo, Tunisia
[2] Univ Carthage, IHEC Carthage, LARODEC, Carthage Presidency, Tunisia
[3] Univ Lorraine, LORIA, Campus Sci, F-54600 Villers Les Nancy, France
关键词
Community Question Answering; Question retrieval; Word embeddings;
D O I
10.1016/j.procs.2019.09.203
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community Question Answering (CQA) services have evolved into a popular way of online information seeking, where users can interact and exchange knowledge in the form of questions and answers. In this paper, we study the problem of finding historical questions that are semantically equivalent to the queried ones, assuming that the answers to the similar questions should also answer the new ones. The major challenge of question retrieval is the word mismatch problem between questions, as users can formulate the same question using different wording. Most existing methods measure the similarity between questions based on the bag of-words (BOWS) representation capturing no semantics between words. Therefore, this study proposes to use word embeddings, which can capture semantic and syntactic information from contexts, to vectorize the questions. The questions are clustered using Kmeans to speed up the search and ranking tasks. The similarity between the questions is measured using cosine similarity based on their weighted continuous valued vectors. We run our experiments on real world data set from Yahoo! Answers in English and Arabic to show the efficiency and generality of our proposed method. (C) 2019 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/) Peer-review under responsibility of KES International.
引用
收藏
页码:485 / 494
页数:10
相关论文
共 50 条
  • [31] Enhancing yes/no question answering with weak supervision via extractive question answering
    Dimitriadis, Dimitris
    Tsoumakas, Grigorios
    [J]. APPLIED INTELLIGENCE, 2023, 53 (22) : 27560 - 27570
  • [32] Alignment over Heterogeneous Embeddings for Question Answering
    Yadav, Vikas
    Bethard, Steven
    Surdeanu, Mihai
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2681 - 2691
  • [33] Document retrieval in the context of question answering
    Monz, C
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 571 - 579
  • [34] Generative retrieval for conversational question answering
    Li, Yongqi
    Yang, Nan
    Wang, Liang
    Wei, Furu
    Li, Wenjie
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (05)
  • [35] Multilingue Passage Retrieval for Question Answering
    Gomez, Jose M.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (40): : 149 - 150
  • [36] Arabic community question answering
    Nakov, Preslav
    Marquez, Lluis
    Moschitti, Alessandro
    Mubarak, Hamdy
    [J]. NATURAL LANGUAGE ENGINEERING, 2019, 25 (01) : 5 - 41
  • [37] Learning to Rank for Question Routing in Community Question Answering
    Ji, Zongcheng
    Wang, Bin
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2363 - 2368
  • [38] Formulation of a hybrid expertise retrieval system in community question answering services
    Dipankar Kundu
    Deba Prasad Mandal
    [J]. Applied Intelligence, 2019, 49 : 463 - 477
  • [39] Question Answering over Knowledge Base using Language Model Embeddings
    Sai Sharath, Japa
    Banafsheh, Rekabdar
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [40] Formulation of a hybrid expertise retrieval system in community question answering services
    Kundu, Dipankar
    Mandal, Deba Prasad
    [J]. APPLIED INTELLIGENCE, 2019, 49 (02) : 463 - 477