Language processing and learning models for community question answering in Arabic

被引:16
|
作者
Romeo, Salvatore [1 ]
Da San Martino, Giovanni [1 ]
Belinkov, Yonatan [2 ]
Barron-Cedeno, Alberto [1 ]
Eldesouki, Mohamed [1 ]
Darwish, Kareem [1 ]
Mubarak, Hamdy [1 ]
Glass, James [2 ]
Moschitti, Alessandro [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Doha, Qatar
[2] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Community question answering; Constituency parsing in Arabic; Tree-kernel-based ranking; Long short-term memory neural networks; Attention models;
D O I
10.1016/j.ipm.2017.07.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we focus on the problem of question ranking in community question answering (cQA) forums in Arabic. We address the task with machine learning algorithms using advanced Arabic text representations. The latter are obtained by applying tree kernels to constituency parse trees combined with textual similarities, including word embeddings. Our two main contributions are: (i) an Arabic language processing pipeline based on UIMA-from segmentation to constituency parsing-built on top of Farasa, a state-of-the-art Arabic language processing toolkit; and (ii) the application of long short-term memory neural networks to identify the best text fragments in questions to be used in our tree-kernel-based ranker. Our thorough experimentation on a recently released cQA dataset shows that the Arabic linguistic processing provided by Farasa produces strong results and that neural networks combined with tree kernels further boost the performance in terms of both efficiency and accuracy. Our approach also enables an implicit comparison between different processing pipelines as our tests on Farasa and Stanford parsers demonstrate. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:274 / 290
页数:17
相关论文
共 50 条
  • [1] Arabic community question answering
    Nakov, Preslav
    Marquez, Lluis
    Moschitti, Alessandro
    Mubarak, Hamdy
    [J]. NATURAL LANGUAGE ENGINEERING, 2019, 25 (01) : 5 - 41
  • [2] Experimenting with a question answering system for the Arabic language
    Hammo, B
    Abuleil, S
    Lytinen, S
    Evens, M
    [J]. COMPUTERS AND THE HUMANITIES, 2004, 38 (04): : 397 - 415
  • [3] Experimenting with a Question Answering System for the Arabic Language
    Bassam Hammo
    Saleem Abuleil
    Steven Lytinen
    Martha Evens
    [J]. Computers and the Humanities, 2004, 38 : 397 - 415
  • [4] Learning English and Arabic question similarity with Siamese Neural Networks in community question answering services
    Othman, Nouha
    Faiz, Rim
    Smaili, Kamel
    [J]. DATA & KNOWLEDGE ENGINEERING, 2022, 138
  • [5] Interactive Language Learning by Question Answering
    Yuan, Xingdi
    Cote, Marc-Alexandre
    Fu, Jie
    Lin, Zhouhan
    Pal, Christopher
    Bengio, Yoshua
    Trischler, Adam
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2796 - 2813
  • [6] Evaluation of an Arabic Chatbot Based on Extractive Question-Answering Transfer Learning and Language Transformers
    Alruqi, Tahani N.
    Alzahrani, Salha M.
    [J]. AI, 2023, 4 (03) : 667 - 691
  • [7] Arabic Biomedical Community Question Answering Based on Contextualized Embeddings
    El Adlouni, Yassine
    Nahnahi, Noureddine En
    El Alaoui, Said Ouatik
    Meknassi, Mohammed
    Rodriguez, Horacio
    Alami, Nabil
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2021, 17 (03) : 13 - 29
  • [8] Deep neural network approach for arabic community question answering
    Almiman, Ali
    Osman, Nada
    Torki, Marwan
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2020, 59 (06) : 4427 - 4434
  • [9] Learning to Rank for Question Routing in Community Question Answering
    Ji, Zongcheng
    Wang, Bin
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2363 - 2368
  • [10] A SUPERVISED LEARNING APPROACH USING THE COMBINATION OF SEMANTIC AND LEXICAL FEATURES FOR ARABIC COMMUNITY QUESTION ANSWERING
    Abdel-Latif, Mahmoud
    Samir, Mohamed
    Abdel-Aziz, Shady
    Heeba, Mohamed
    Elmasry, Ahmed
    Torki, Marwan
    [J]. 2018 IEEE/ACS 15TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2018,