A Deep Learning Framework for Automatic Detection of Hate Speech Embedded in Arabic Tweets

被引:0
|
作者
Rehab Duwairi
Amena Hayajneh
Muhannad Quwaider
机构
[1] Jordan University of Science and Technology,
关键词
Arabic hate speech; Neural networks; Automatic detection of hateful speech; Deep learning; Text mining; Twitter;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we investigate the ability of CNN, CNN-LSTM, and BiLSTM-CNN deep learning networks to automatically classify or discover hateful content posted on social media. These deep networks were trained and tested using ArHS dataset which consists of 9833 tweets that were annotated to suite hateful speech detection in Arabic. To the best of our knowledge, this is the largest Arabic dataset which handles the subclasses of hate speech. Moreover, we investigate the performance on two existing Arabic hate speech datasets along with ArHS dataset resulting in a combined dataset which consists of 23,678 tweets. Three types of experiment are reported: first, the binary classification of tweets into Hate or Normal, second, ternary classification of tweets into (Hate, Abusive, or Normal), and lastly, multi-class classification of tweets into (Misogyny, Racism, Religious Discrimination, Abusive, and Normal). Using the ArHS dataset, in the binary classification task, the CNN model outperformed other models and achieved an accuracy of 81%. In the ternary classification task, both the CNN and BiLSTM-CNN models achieved the best accuracy of 74%. Lastly, in the multi-class classification task, CNN-LSTM and the BiLSTM-CNN models both achieved the best results with an accuracy of 73%. On the Combined dataset, in the binary classification task, the BiLSTM-CNN achieved an accuracy of 73%. In the ternary classification task, BiLSTM-CNN achieved the best accuracy of 67%. Lastly, in the multi-class classification task, the CNN-LSTM and the BiLSTM-CNN achieved the best accuracy of 65%.
引用
收藏
页码:4001 / 4014
页数:13
相关论文
共 50 条
  • [21] Identification of Multiword Expressions in Tweets for Hate Speech Detection
    Zampieri, Nicolas
    Ramisch, Carlos
    Illina, Irina
    Fohr, Dominique
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 202 - 210
  • [22] An optimized deep learning approach for suicide detection through Arabic tweets
    Baghdadi, Nadiah A.
    Malki, Amer
    Balaha, Hossam Magdy
    AbdulAzeem, Yousry
    Badawy, Mahmoud
    Elhosseini, Mostafa
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [23] An optimized deep learning approach for suicide detection through Arabic tweets
    Baghdadi N.A.
    Malki A.
    Balaha H.M.
    AbdulAzeem Y.
    Badawy M.
    Elhosseini M.
    PeerJ Comput. Sci., 2022,
  • [24] Detection of Hate Speech in COVID-19-Related Tweets in the Arab Region: Deep Learning and Topic Modeling Approach
    Alshalan, Raghad
    Al-Khalifa, Hend
    Alsaeed, Duaa
    Al-Baity, Heyam
    Alshalan, Shahad
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (12)
  • [25] mBERT-GRU multilingual deep learning framework for hate speech detection in social media
    Singh, Pardeep
    Singh, Nitin Kumar
    Monika
    Chand, Satish
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 8177 - 8192
  • [26] Hate speech detection with ADHAR: a multi-dialectal hate speech corpus in Arabic
    Charfi, Anis
    Besghaier, Mabrouka
    Akasheh, Raghda
    Atalla, Andria
    Zaghouani, Wajdi
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [27] Semi-Supervised Self-Learning for Arabic Hate Speech Detection
    Alsafari, Safa
    Sadaoui, Samira
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 863 - 868
  • [28] Deep learning for emotion analysis in Arabic tweets
    Khalil, Enas A. Hakim
    El Houby, Enas M. F.
    Mohamed, Hoda Korashy
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [29] Deep learning for emotion analysis in Arabic tweets
    Enas A. Hakim Khalil
    Enas M. F. El Houby
    Hoda Korashy Mohamed
    Journal of Big Data, 8
  • [30] Improving Sinhala Hate Speech Detection Using Deep Learning
    Gamage, Kavishka
    Welgama, Viraj
    Weerasinghe, Ruvan
    2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,