Deep Neural Network Models for Paraphrased Text Classification in the Arabic Language

被引:7
|
作者
Mahmoud, Adnen [1 ,2 ]
Zrigui, Mounir [1 ]
机构
[1] Univ Monastir, Algebra Numbers Theory & Nonlinear Analyzes Lab L, Monastir, Tunisia
[2] Univ Sousse, Higher Inst Comp Sci & Commun Tech, Hammam Sousse, Sousse, Tunisia
关键词
Paraphrase detection; Deep learning; Word embedding; Convolutional neural network; Long short term memory; Arabic corpus construction;
D O I
10.1007/978-3-030-23281-8_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Paraphrase is the act of reusing original texts without proper citation of the source. Different obfuscation operations can be employed such as addition/deletion of words, synonym substitutions, lexical changes, active to passive switching, etc. This phenomenon dramatically increased because of the progressive advancement of the web and the automatic text editing tools. Recently, deep leaning methods have gained competitive results than traditional methods for Natural Language Processing (NLP). In this context, we consider the problem of Arabic paraphrase detection. We present different deep neural networks like Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM). Our aim is to study the effective of each one in extracting the proper features of sentences without the knowledge of semantic and syntactic structure of Arabic language. For the experiments, we propose an automatic corpus construction seeing the lack of Arabic resources publicly available. Evaluations reveal that LSTM model achieved the higher rate of semantic similarity and outperformed significantly other state-of-the-art methods.
引用
收藏
页码:3 / 16
页数:14
相关论文
共 50 条
  • [1] Neural Network for Arabic Text Classification
    Harrag, Fouzi
    El-Qawasmah, Eyas
    [J]. 2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009), 2009, : 778 - +
  • [2] Arabic text classification using deep learning models
    Elnagar, Ashraf
    Al-Debsi, Ridhwan
    Einea, Omar
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)
  • [3] Deep Neural Models and Retrofitting for Arabic Text Categorization
    El-Alami, Fatima-Zahra
    El Alaoui, Said Ouatik
    En-Nahnahi, Noureddine
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2020, 16 (02) : 74 - 86
  • [4] Deep Transformer Language Models for Arabic Text Summarization: A Comparison Study
    Chouikhi, Hasna
    Alsuhaibani, Mohammed
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [5] A Deep Belief Network Classification Approach for Automatic Diacritization of Arabic Text
    Almanaseer, Waref
    Alshraideh, Mohammad
    Alkadi, Omar
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (11):
  • [6] Arabic Text Classification Using Convolutional Neural Network and Genetic Algorithms
    Alsaleh, Deem
    Larabi-Marie-Sainte, Souad
    [J]. IEEE ACCESS, 2021, 9 : 91670 - 91685
  • [7] GNoM: Graph Neural Network Enhanced Language Models for Disaster Related Multilingual Text Classification
    Ghosh, Samujjwal
    Maji, Subhadeep
    Desarkar, Maunendra Sankar
    [J]. PROCEEDINGS OF THE 14TH ACM WEB SCIENCE CONFERENCE, WEBSCI 2022, 2022, : 55 - 65
  • [8] Convolutional Deep Belief Network Based Short Text Classification on Arabic Corpus
    Motwakel, Abdelwahed
    Al-Onazi, Badriyya B.
    Alzahrani, Jaber S.
    Marzouk, Radwa
    Aziz, Amira Sayed A.
    Zamani, Abu Sarwar
    Yaseen, Ishfaq
    Abdelmageed, Amgad Atta
    [J]. Computer Systems Science and Engineering, 2023, 45 (03): : 3097 - 3113
  • [9] A Comparative Study of Deep Neural Network Models on Multi-Label Text Classification in Finance
    Maia, Macedo
    Sales, Juliano Efson
    Freitas, Andre
    Handschuh, Siegfried
    Endres, Markus
    [J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 183 - 190
  • [10] A Deep Learning Approach for Arabic Text Classification
    Sundus, Katrina
    Al-Haj, Fatima
    Hammo, Bassam
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 258 - 264