A machine learning-based approach for sentiment analysis on distance learning from Arabic Tweets

被引:3
|
作者
Almalki, Jameel [1 ]
机构
[1] Umm Al Qura Univ, Coll Comp Al Leith, Dept Comp Sci, Mecca, Saudi Arabia
关键词
Sentiment analysis; Social media; -Learning; Twitter; Apache Spark; Arabic language; SOCIAL MEDIA;
D O I
10.7717/peerj-cs.1047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media platforms such as Twitter, YouTube, Instagram and Facebook are leading sources of large datasets nowadays. Twitter's data is one of the most reliable due to its privacy policy. Tweets have been used for sentiment analysis and to identify meaningful information within the dataset. Our study focused on the distance learning domain in Saudi Arabia by analyzing Arabic tweets about distance learning. This work proposes a model for analyzing people's feedback using a Twitter dataset in the distance learning domain. The proposed model is based on the Apache Spark product to manage the large dataset. The proposed model uses the Twitter API to get the tweets as raw data. These tweets were stored in the Apache Spark server. A regex-based technique for preprocessing removed retweets, links, hashtags, English words and numbers, usernames, and emojis from the dataset. After that, a Logistic-based Regression model was trained on the pre-processed data. This Logistic Regression model, from the field of machine learning, was used to predict the sentiment inside the tweets. Finally, a Flask application was built for sentiment analysis of the Arabic tweets. The proposed model gives better results when compared to various applied techniques. The proposed model is evaluated on test data to calculate Accuracy, F1 Score, Precision, and Recall, obtaining scores of 91%, 90%, 90%, and 89%, respectively.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A machine learning-based approach for sentiment analysis on distance learning from Arabic Tweets
    Almalki, Jameel
    [J]. PeerJ Computer Science, 2022, 8
  • [2] Evaluating sentiment analysis for Arabic Tweets using machine learning and deep learning
    Alshutayri, Areej
    Alamoudi, Huda
    Alshehri, Boushra
    Aldhahri, Eman
    Alsaleh, Iqbal
    Aljojo, Nahla
    Alghoson, Abdullah
    [J]. ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2022, 32 (04): : 7 - 18
  • [3] Topic features for machine learning-based sentiment analysis in Indonesian tweets
    Murfi, Hendri
    Siagian, Furida Lusi
    Satria, Yudi
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2019, 12 (01) : 70 - 81
  • [4] Quantum Particle Swarm Optimization with Deep Learning-Based Arabic Tweets Sentiment Analysis
    Al-onazi, Badriyya B.
    Hassan, Abdulkhaleq Q. A.
    Nour, Mohamed K.
    Al Duhayyim, Mesfer
    Mohamed, Abdullah
    Abdelmageed, Amgad Atta
    Yaseen, Ishfaq
    Mohammed, Gouse Pasha
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 2575 - 2591
  • [5] Sentiment Analysis of Tweets using Machine Learning Approach
    Rathi, Megha
    Malik, Aditya
    Varshney, Daksh
    Sharma, Rachita
    Mendiratta, Sarthak
    [J]. 2018 ELEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2018, : 365 - 367
  • [6] A Machine Learning-Based Lexicon Approach for Sentiment Analysis
    Sahu, Tirath Prasad
    Khandekar, Sarang
    [J]. INTERNATIONAL JOURNAL OF TECHNOLOGY AND HUMAN INTERACTION, 2020, 16 (02) : 8 - 22
  • [7] HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis
    Elshakankery, Kariman
    Ahmed, Mona F.
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2019, 20 (03) : 163 - 171
  • [8] Sentiment analysis of tweets through Altmetrics: A machine learning approach
    Hassan, Saeed-Ul
    Saleem, Aneela
    Soroya, Saira Hanif
    Safder, Iqra
    Iqbal, Sehrish
    Jamil, Saqib
    Bukhari, Faisal
    Aljohani, Naif Radi
    Nawaz, Raheel
    [J]. JOURNAL OF INFORMATION SCIENCE, 2021, 47 (06) : 712 - 726
  • [9] Sentiment Analysis of Arabic Tweets using Deep Learning
    Heikal, Maha
    Torki, Marwan
    El-Makky, Nagwa
    [J]. ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 114 - 122
  • [10] Sentiment Analysis of Arabic Tweets about Violence Against Women using Machine Learning
    Alzyout, Moath
    Al Bashabsheh, Emran
    Najadat, Hassan
    Alaiad, Ahmad
    [J]. 2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 171 - 176