Detecting Hate Speech in Arabic Tweets During COVID-19 Using Machine Learning Approaches

被引:6
|
作者
Alhejaili, Ruba [1 ]
Alsaeedi, Abdullah [1 ]
Yafooz, Wael M. S. [1 ]
机构
[1] Taibah Univ, Coll Comp Sci & Engn, Dept Comp Sci, Madinah, Saudi Arabia
关键词
Hate speech; Coronavirus classification; Feature extraction; Machine learning;
D O I
10.1007/978-981-19-3148-2_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Content on the Web is increasing day by day, especially on social media, as all users can express their opinions freely and without restrictions. Accordingly, many negative activities have appeared, such as abusive language, racism, and hate speech. Hate speech is one of the negative social media manifestations that require tools to be detected. In this paper, we try to detect hate speech in Arabic tweets published during the COVID-19 pandemic. We compiled a dataset during the pandemic period from January 31 to March 6, 2021. We used a set of machine learning models, namely support vector machine (SVM), random forest (RF), logistic regression (DT), decision tree, AdaBoost, k-nearest neighbors (KNN), and Gaussian naive Bayes (GNB). For feature extraction, we used TF-IDF, where we trained the dataset in three types: unigram, bigram, and trigram. The best results were achieved by LR, RF, and SVM, with an accuracy of 90.8% for LR.
引用
收藏
页码:467 / 475
页数:9
相关论文
共 50 条
  • [1] Detection of hate speech in Arabic tweets using deep learning
    Al-Hassan, Areej
    Al-Dossari, Hmood
    [J]. MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1963 - 1974
  • [2] Detection of hate speech in Arabic tweets using deep learning
    Areej Al-Hassan
    Hmood Al-Dossari
    [J]. Multimedia Systems, 2022, 28 : 1963 - 1974
  • [3] Detecting Arabic Cyberbullying Tweets Using Machine Learning
    Alduailaj, Alanoud Mohammed
    Belghith, Aymen
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (01): : 29 - 42
  • [4] Evaluating Machine Learning Techniques for Detecting Offensive and Hate Speech in South African Tweets
    Oriola, Oluwafemi
    Kotze, Eduan
    [J]. IEEE ACCESS, 2020, 8 : 21496 - 21509
  • [5] COVID-19 Tweets Classification during Lockdown Period Using Machine Learning Classifiers
    Jafar Zaidi, Syed Ali
    Chatterjee, Indranath
    Brahim Belhaouari, Samir
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2022, 2022
  • [6] Detecting Suicidality in Arabic Tweets Using Machine Learning and Deep Learning Techniques
    Abdulsalam, Asma
    Alhothali, Areej
    Al-Ghamdi, Saleh
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (9) : 12729 - 12742
  • [7] Code-mixing unveiled: Enhancing the hate speech detection in Arabic dialect tweets using machine learning models
    Alhazmi, Ali
    Mahmud, Rohana
    Idris, Norisma
    Abo, Mohamed Elhag Mohamed
    Eke, Christopher Ifeanyi
    [J]. PLOS ONE, 2024, 19 (07):
  • [8] Machine learning based approaches for detecting COVID-19 using clinical text data
    Khanday A.M.U.D.
    Rabani S.T.
    Khan Q.R.
    Rouf N.
    Mohi Ud Din M.
    [J]. International Journal of Information Technology, 2020, 12 (3) : 731 - 739
  • [9] A Deep Learning Framework for Automatic Detection of Hate Speech Embedded in Arabic Tweets
    Rehab Duwairi
    Amena Hayajneh
    Muhannad Quwaider
    [J]. Arabian Journal for Science and Engineering, 2021, 46 : 4001 - 4014
  • [10] Arabic Tweets Sentiment Analysis about Online Learning during COVID-19 in Saudi Arabia
    Althagafi, Asma
    Althobaiti, Ghofran
    Alhakami, Hosam
    Alsubait, Tahani
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (03) : 620 - 625