Leveraging Arabic sentiment classification using an enhanced CNN-LSTM approach and effective Arabic text preparation

被引:14
|
作者
Alayba, Abdulaziz M. [1 ]
Palade, Vasile [2 ]
机构
[1] Univ Hail, Coll Comp Sci & Engn, Dept Informat & Comp Sci, Hail 81481, Saudi Arabia
[2] Coventry Univ, Ctr Computat Sci & Math Modelling, Coventry CV1 5FB, W Midlands, England
关键词
Arabic NLP; Arabic sentiment analysis; CNN; LSTM; Word embedding for Arabic; Arabic word normalisation; Deep learning; RECURRENT NEURAL-NETWORK; MACHINE; LEXICON; WORDS; MODEL;
D O I
10.1016/j.jksuci.2021.12.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The high variety in the forms of the Arabic words creates significant complexity related challenges in Natural Language Processing (NLP) tasks for Arabic text. These challenges can be dealt with by using different techniques for semantic representation, such as word embedding methods. In addition, approaches for reducing the diversity in Arabic morphologies can also be employed, for example using appropriate word normalisation for Arabic texts. Deep learning has proven to be very popular in solving different NLP tasks in recent years as well. This paper proposes an approach that combines Convolutional Neural Networks (CNNs) with Long Short-Term Memory (LSTM) networks to improve sentiment classification, by excluding the max-pooling layer from the CNN. This layer reduces the length of generated feature vectors after convolving the filters on the input data. As such, the LSTM networks will receive well-captured vectors from the feature maps. In addition, the paper investigated different effective approaches for preparing and representing the text features in order to increase the accuracy of Arabic sentiment classification.
引用
收藏
页码:9710 / 9722
页数:13
相关论文
共 50 条
  • [1] SEDAT: Sentiment and Emotion Detection in Arabic Text using CNN-LSTM Deep Learning
    Abdullah, Malak
    Hadzikadic, Mirsad
    Shaikh, Samira
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 835 - 840
  • [2] Arabic Sentiment Analysis Using Naive Bayes and CNN-LSTM
    Suleiman, Dima
    Odeh, Aseel
    Al-Sayyed, Rizik
    [J]. INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2022, 46 (06): : 79 - 86
  • [3] A Combined CNN and LSTM Model for Arabic Sentiment Analysis
    Alayba, Abdulaziz M.
    Palade, Vasile
    England, Matthew
    Iqbal, Rahat
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, CD-MAKE 2018, 2018, 11015 : 179 - 191
  • [4] Deep learning CNN-LSTM framework for Arabic sentiment analysis using textual information shared in social networks
    Ombabi, Abubakr H.
    Ouarda, Wael
    Alimi, Adel M.
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2020, 10 (01)
  • [5] An Efficient Hybrid LSTM-CNN and CNN-LSTM with GloVe for Text Multi-class Sentiment Classification in Gender Violence
    Ismail, Abdul Azim
    Yusoff, Marina
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 853 - 863
  • [6] Improving Arabic Sentiment Classification Using a Combined Approach
    Brahimi, Belgacem
    Touahria, Mohamed
    Tari, Abdelkamel
    [J]. COMPUTACION Y SISTEMAS, 2020, 24 (04): : 1403 - 1414
  • [7] Arabic Sentiment Classification: A Hybrid Approach
    Biltawi, Mariam
    Al-Naymat, Ghazi
    Tedmori, Sara
    [J]. 2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 104 - 108
  • [8] Combining Word Order and CNN-LSTM for Sentence Sentiment Classification
    Shuang, Kai
    Ren, Xintao
    Chen, Jian
    Shan, Xiaohan
    Xu, Peng
    [J]. 2017 INTERNATIONAL CONFERENCE ON SOFTWARE AND E-BUSINESS (ICSEB 2017), 2015, : 17 - 21
  • [9] Enhanced Classification of Sentiment Analysis of Arabic Reviews
    Alnemer, Loai
    Alammouri, Bayan
    Alsakran, Jamal
    El Ariss, Omar
    [J]. ADVANCES IN INTERNET, DATA AND WEB TECHNOLOGIES, 2019, 29 : 210 - 220
  • [10] Standard and Dialectal Arabic Text Classification for Sentiment Analysis
    Maghfour, Mohcine
    Elouardighi, Abdeljalil
    [J]. MODEL AND DATA ENGINEERING, MEDI 2018, 2018, 11163 : 282 - 291