Comparative analysis of Deep Learning and Machine Learning algorithms for emoji prediction from Arabic text

被引:1
|
作者
Mokhamed, Takua [1 ]
Harous, Saad [1 ]
Hussein, Nada [2 ]
Ismail, Heba [2 ]
机构
[1] Univ Sharjah, Coll Comp & Informat, Dept Comp Sci, Sharjah, U Arab Emirates
[2] Abu Dhabi Univ, Coll Engn, Abu Dhabi, U Arab Emirates
关键词
Emoji prediction; Recommendation; Arabic sentence; Natural Language Processing; Machine Learning; Deep Learning;
D O I
10.1007/s13278-024-01217-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emojis have become a crucial part of text-based communication in recent years, especially on social media and messaging services. As a result, emoji prediction has gained increasing attention as a research topic in Natural Language Processing. Emoji recommendation is a task of predicting relevant emojis based on the emotional and contextual orientation of the text. In this study, we provide a comparative analysis of several Machine Learning (ML) and Deep Learning (DL) methods for emoji prediction from Arabic text. ML models are commonly used as baselines for emoji prediction; hence, more sophisticated DL models are needed for performance enhancement. In this work, we evaluate the performance of three baseline ML models, namely Support Vector Machines (SVM), Multinomial Naive Bayes (MNB), and Random Forest (RF), as well as state-of-art DL models, namely Long Short-Term Memory (LSTM), Bidirectional Long Short-Term Memory (BiLSTM), Arabic Bidirectional Encoder Representations from Transformers (AraBERT), and Multilingual Bidirectional Encoder Representations from Transformers (mBERT). This research is evaluated utilizing a large corpus of Twitter dataset that is translated to Arabic and balanced to enhance the prediction performance. Throughout the experiments, the ML models achieved classification accuracies of 74%, 78.9%, and 84% for SVM, MNB, and RF, respectively. Furthermore, the DL models achieved accuracies of 91.16%, 91%, 85%, and 80% for LSTM, BiLSTM, AraBERT, and mBERT, respectively.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Comparative analysis of Deep Learning and Machine Learning algorithms for emoji prediction from Arabic text
    Takua Mokhamed
    Saad Harous
    Nada Hussein
    Heba Ismail
    [J]. Social Network Analysis and Mining, 14
  • [2] Machine Learning Algorithms for Attitude Prediction from Arabic Text: Detecting Student Attitude towards Online Learning
    Alshdaifat, Esraa
    Al-Shdaifat, Ala’A
    Alsarhan, Ayoub
    [J]. International Journal of Interactive Mobile Technologies, 2024, 18 (12) : 42 - 56
  • [3] Machine learning algorithms in Arabic Text Classification: A Review
    Aboalnaser, Sara A.
    [J]. 12TH INTERNATIONAL CONFERENCE ON THE DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2019), 2019, : 290 - 295
  • [4] Analyzing Machine Learning Algorithms for Sentiments in Arabic Text
    Yafoz, Ayman
    Mouhoub, Malek
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2150 - 2156
  • [5] Comparative Analysis of Machine Learning Algorithms for Rainfall Prediction
    Patil, Rudragoud
    Bedekar, Gayatri
    [J]. INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 833 - 842
  • [6] CHURN PREDICTION - A COMPARATIVE ANALYSIS WITH SUPERVISED MACHINE LEARNING ALGORITHMS
    Gangadharan, Chika K.
    Alex, Roshni
    Sabu, M. K.
    [J]. ADVANCES AND APPLICATIONS IN MATHEMATICAL SCIENCES, 2021, 20 (12): : 3049 - 3060
  • [7] Comparative Analysis of Machine Learning Algorithms to Urban Traffic Prediction
    Lee, Yong-Ju
    Min, Okgee
    [J]. 2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1034 - 1036
  • [8] Machine Learning Algorithms for Transportation Mode Prediction: A Comparative Analysis
    Murrar S.
    Alhaj F.
    Qutqut M.H.
    [J]. Informatica (Slovenia), 2024, 48 (06): : 117 - 130
  • [9] Comparative Analysis of Machine Learning Algorithms for CKD Risk Prediction
    Yang, Weilin
    Ahmed, Nasim
    Barczak, Andre L. C.
    [J]. IEEE Access, 2024, 12 : 171205 - 171220
  • [10] A COMPARATIVE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR IPO UNDERPERFORMANCE PREDICTION
    Sonsare, Pravinkumar M.
    Pande, Ashtavinayak
    Kumar, Sudhanshu
    Kurve, Akshay
    Shanbhag, Chinmay
    [J]. JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2023, 5 (06):