Learning English and Arabic question similarity with Siamese Neural Networks in community question answering services

被引:15
|
作者
Othman, Nouha [1 ,2 ]
Faiz, Rim [3 ]
Smaili, Kamel [1 ]
机构
[1] Univ Lorraine, LORIA, Campus Sci, F-54600 Vandoeuvre Les Nancy, France
[2] Univ Tunis, ISG Tunis, LARODEC, Bardo, Tunisia
[3] Univ Carthage, IHEC Carthage, LARODEC, Carthage Presidency, Tunisia
关键词
Community question answering; Question retrieval; Siamese; LSTM; CNN;
D O I
10.1016/j.datak.2021.101962
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we tackle the task of similar question retrieval (QR) which is essential for Community Question Answering (cQA) and aims to retrieve historical questions that are semantically equivalent to the new queries. Over time, with the sharp increase of community archives and the accumulation of duplicated questions, the QR problem has become increasingly challenging due to the shortness of the community questions as well as the word mismatch problem as users can formulate the same query using different wording. Although many efforts have been devoted to address this problem, existing methods mostly relied on supervised models which significantly depend on massive training data sets and manual feature engineering. Such methods are chiefly constrained by their specificities that ignore the word order and do not capture enough syntactic and semantic information in questions. In this paper, we rely on Neural Networks (NNs) which use a deep analysis of words and questions to take into consideration the semantics as well as the structure of questions to predict the semantic text similarity. We propose a deep learning approach based on a Siamese architecture with Long Short-Term Memory (LSTM) networks, augmented with an attention mechanism to let the model give different words different attention while modeling questions. We also explore the use of Convolutional Neural Networks (CNN) nested within the Siamese architecture to retrieve relevant questions. Different similarity measures were tested to predict the semantic similarity between the pairs of questions. To evaluate the proposed approach, we conducted experiments on large-scale datasets in English and Arabic.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Arabic community question answering
    Nakov, Preslav
    Marquez, Lluis
    Moschitti, Alessandro
    Mubarak, Hamdy
    [J]. NATURAL LANGUAGE ENGINEERING, 2019, 25 (01) : 5 - 41
  • [2] Neural Arabic Question Answering
    Mozannar, Hussein
    El Hajal, Karl
    Maamary, Elie
    Hajj, Hazem
    [J]. FOURTH ARABIC NATURAL LANGUAGE PROCESSING WORKSHOP (WANLP 2019), 2019, : 108 - 118
  • [3] Manhattan Siamese LSTM for Question Retrieval in Community Question Answering
    Othman, Nouha
    Faiz, Rim
    Smaili, Kamel
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2019 CONFERENCES, 2019, 11877 : 661 - 677
  • [4] Deep neural network approach for arabic community question answering
    Almiman, Ali
    Osman, Nada
    Torki, Marwan
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2020, 59 (06) : 4427 - 4434
  • [5] Learning semantic representation with neural networks for community question answering retrieval
    Zhou, Guangyou
    Zhou, Yin
    He, Tingting
    Wu, Wensheng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 93 : 75 - 83
  • [6] Language processing and learning models for community question answering in Arabic
    Romeo, Salvatore
    Da San Martino, Giovanni
    Belinkov, Yonatan
    Barron-Cedeno, Alberto
    Eldesouki, Mohamed
    Darwish, Kareem
    Mubarak, Hamdy
    Glass, James
    Moschitti, Alessandro
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (02) : 274 - 290
  • [7] Answer Sequence Learning with Neural Networks for Answer Selection in Community Question Answering
    Zhou, Xiaoqiang
    Hu, Baotian
    Chen, Qingcai
    Tang, Buzhou
    Wang, Xiaolong
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 713 - 718
  • [8] Learning Question Similarity with Recurrent Neural Networks
    Ye, Borui
    Feng, Guangyu
    Cheriton, David R.
    Cui, Anqi
    Li, Ming
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (IEEE ICBK 2017), 2017, : 111 - 118
  • [9] Learning to Rank for Question Routing in Community Question Answering
    Ji, Zongcheng
    Wang, Bin
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2363 - 2368
  • [10] Question Popularity Analysis and Prediction in Community Question Answering Services
    Liu, Ting
    Zhang, Wei-Nan
    Cao, Liujuan
    Zhang, Yu
    [J]. PLOS ONE, 2014, 9 (05):