Detection of Hateful Social Media Content for Arabic Language

被引:5
|
作者
Al-Ibrahim, Rogayah M. [1 ]
Ali, Mostafa Z. [1 ]
Najadat, Hassan M. [1 ]
机构
[1] Jordan Univ Sci & Technol, POB 3030, Irbid 22110, Jordan
关键词
Hate speech; Arabic language; deep learning; classification; machine learning; Arabic tweets; SPEECH; TWITTER;
D O I
10.1145/3592792
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media is a common medium for expression of views, discussion, sharing of content, and promotion of products and ideas. These views are either polite or obscene. The growth of hate speech is one of the negative aspects of the medium and its emergence poses risk factors for society at various levels. Although there are rules and laws for these platforms, they cannot oversee and control all types of content. Thus, there is an urgent need to develop modern algorithms to automatically detect hateful content on social media. Arab society is not isolated from the world, and the usage of social media by its members has highlighted the importance of automated systems that help build an electronic society free of hate and aggression. This article aims to detect hate speech based on Arabic context over the Twitter platform by proposing different novel deep learning architectures in order to provide a thorough analytical study. Also, a comparative study is presented with a different well-known machine learning algorithm, as well as other state-of-the-art algorithms from the literature to be used as a beacon for interested researchers. These models have been applied to the Arabic tweets dataset, which included 15K tweets and 14 features. After training these models, the results obtained for the top two models included an improved bidirectional long short-term memory with an accuracy of 92.20% and a macro F1-score of 92% and a modified convolutional neural network with an accuracy of 92.10% and a macro F1-score of 91%. The results also showed the superiority of the performance of the deep learning models over other models in terms of accuracy.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Quantum computing and machine learning for Arabic language sentiment classification in social media
    Ahmed Omar
    Tarek Abd El-Hafeez
    Scientific Reports, 13
  • [32] Clinical Trials in Social Media: Content Analysis of Available YouTube Videos in Arabic
    Al-Tabba, Amal
    Al-Omari, Amal
    Al-Hussaini, Maysa
    JOURNAL OF EMPIRICAL RESEARCH ON HUMAN RESEARCH ETHICS, 2020, 15 (03) : NP1 - NP2
  • [33] Geotagging Social Media Content with a Refined Language Modelling Approach
    Kordopatis-Zilos, Giorgos
    Papadopoulos, Symeon
    Kompatsiaris, Yiannis
    INTELLIGENCE AND SECURITY INFORMATICS, PAISI 2015, 2015, 9074 : 21 - 40
  • [34] Language Agnostic Model - Detecting Islamophobic Content on Social Media
    Khan, Heena
    Phillips, Joshua L.
    ACMSE 2021: PROCEEDINGS OF THE 2021 ACM SOUTHEAST CONFERENCE, 2021, : 229 - 233
  • [35] A Survey of Offensive Language Detection for the Arabic Language
    Husain, Fatemah
    Uzuner, Ozlem
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (01)
  • [36] Hate Speech Detection in Social Media for the Kurdish Language
    Saeed, Ari M.
    Ismael, Aso N.
    Rasul, Danya L.
    Majeed, Rayan S.
    Rashid, Tarik A.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INNOVATIONS IN COMPUTING RESEARCH (ICR'22), 2022, 1431 : 253 - 260
  • [37] Sarcasm Detection in Politically Motivated Social Media Content
    Nguyen, Hieu
    Moon, Jihye
    Paul, Nijhum
    Gokhale, Swapna S.
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1538 - 1545
  • [38] Community Detection with Edge Content in Social Media Networks
    Qi, Guo-Jun
    Aggarwal, Charu C.
    Huang, Thomas
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 534 - 545
  • [39] Cancer Treatment Using Herbals in Arabic Social Media: Content Analysis of YouTube Videos
    Abu Daabes, Ajayeb S.
    2018 1ST INTERNATIONAL CONFERENCE ON CANCER CARE INFORMATICS (CCI), 2018, : 215 - 216
  • [40] Enhancing Arabic Dialect Detection on Social Media: A Hybrid Model with an Attention Mechanism
    Yafooz, Wael M. S.
    INFORMATION, 2024, 15 (06)