ArabBert-LSTM: improving Arabic sentiment analysis based on transformer model and Long Short-Term Memory

被引:0
|
作者
Alosaimi, Wael [1 ]
Saleh, Hager [2 ,3 ,4 ]
Hamzah, Ali A. [5 ]
El-Rashidy, Nora [6 ]
Alharb, Abdullah [1 ]
Elaraby, Ahmed [7 ]
Mostafa, Sherif [2 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Dept Informat Technol, Taif, Saudi Arabia
[2] South Valley Univ, Fac Comp & Artificial Intelligence, Hurghada, Egypt
[3] Galway Univ, Data Sci Inst, Galway, Ireland
[4] Atlantic Technol Univ, Letterkenny, Ireland
[5] Ahram Canadian Univ, 6th Of October City, Egypt
[6] Kafrelsheiksh Univ, Fac Artificial Intelligence, ML & Informat Retrieval Dept, Kafrelsheiksh, Egypt
[7] South Valley Univ, Fac Comp & Informat, Dept Comp Sci, Qena, Egypt
来源
关键词
sentiment analysis; transformer models; deep learning; machine learning; Arabic sentiment analysis; Long Short-Term Memory;
D O I
10.3389/frai.2024.1408845
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis also referred to as opinion mining, plays a significant role in automating the identification of negative, positive, or neutral sentiments expressed in textual data. The proliferation of social networks, review sites, and blogs has rendered these platforms valuable resources for mining opinions. Sentiment analysis finds applications in various domains and languages, including English and Arabic. However, Arabic presents unique challenges due to its complex morphology characterized by inflectional and derivation patterns. To effectively analyze sentiment in Arabic text, sentiment analysis techniques must account for this intricacy. This paper proposes a model designed using the transformer model and deep learning (DL) techniques. The word embedding is represented by Transformer-based Model for Arabic Language Understanding (ArabBert), and then passed to the AraBERT model. The output of AraBERT is subsequently fed into a Long Short-Term Memory (LSTM) model, followed by feedforward neural networks and an output layer. AraBERT is used to capture rich contextual information and LSTM to enhance sequence modeling and retain long-term dependencies within the text data. We compared the proposed model with machine learning (ML) algorithms and DL algorithms, as well as different vectorization techniques: term frequency-inverse document frequency (TF-IDF), ArabBert, Continuous Bag-of-Words (CBOW), and skipGrams using four Arabic benchmark datasets. Through extensive experimentation and evaluation of Arabic sentiment analysis datasets, we showcase the effectiveness of our approach. The results underscore significant improvements in sentiment analysis accuracy, highlighting the potential of leveraging transformer models for Arabic Sentiment Analysis. The outcomes of this research contribute to advancing Arabic sentiment analysis, enabling more accurate and reliable sentiment analysis in Arabic text. The findings reveal that the proposed framework exhibits exceptional performance in sentiment classification, achieving an impressive accuracy rate of over 97%.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] A hybrid convolutional long short-term memory (CNN-LSTM) based natural language processing (NLP) model for sentiment analysis of customer product reviews in Bangla
    Purba, Mahbuba Rahman
    Akter, Moniya
    Ferdows, Rubayea
    Ahmed, Fuad
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2022, 25 (07): : 2111 - 2120
  • [32] An ensemble transformer-based model for Arabic sentiment analysis
    Mohamed, Omar
    Kassem, Aly M. M.
    Ashraf, Ali
    Jamal, Salma
    Mohamed, Ensaf Hussein
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 13 (01)
  • [33] Improving Arabic Sentiment Analysis Using LSTM Based on Word Embedding Models
    Zahidi, Youssra
    Al-Amrani, Yassine
    El Younoussi, Yacine
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (03) : 391 - 407
  • [34] An ensemble transformer-based model for Arabic sentiment analysis
    Omar Mohamed
    Aly M. Kassem
    Ali Ashraf
    Salma Jamal
    Ensaf Hussein Mohamed
    Social Network Analysis and Mining, 13
  • [35] FUEL MOISTURE CONTENT FORECASTING USING LONG SHORT-TERM MEMORY(LSTM) MODEL
    Kang, Zhenyu
    Jiao, Miao
    Zhou, Zijie
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 5672 - 5675
  • [36] Adaptive particle swarm optimization algorithm based long short-term memory networks for sentiment analysis
    Shobana, J.
    Murali, M.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (06) : 10703 - 10719
  • [37] Efficient Long Short-Term Memory-Based Sentiment Analysis of E-Commerce Reviews
    Gondhi, Naveen Kumar
    Chaahat
    Sharma, Eishita
    Alharbi, Amal H.
    Verma, Rohit
    Shah, Mohd Asif
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [38] Transformer Oil Temperature Prediction Based on Long and Short-term Memory Networks
    Sui, Jianxin
    Ling, Xiao
    Xiang, Xing
    Zhang, Genwei
    Zhang, Xiangchi
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 6029 - 6031
  • [39] Text Sentiment Analysis Using the Bald Eagle-Based Bidirectional Long Short-Term Memory
    Kumar, Garadasu Anil
    Reddy, S. Sai Satyanarayana
    Sripallavi, Punna
    Parashuramulu, Bollam
    Banu, B. Suresh
    ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 26 - 36
  • [40] A Sentiment Analysis Method Based on a Blockchain-Supported Long Short-Term Memory Deep Network
    Mendi, Arif Furkan
    SENSORS, 2022, 22 (12)