Detection of Hateful Social Media Content for Arabic Language

被引:5
|
作者
Al-Ibrahim, Rogayah M. [1 ]
Ali, Mostafa Z. [1 ]
Najadat, Hassan M. [1 ]
机构
[1] Jordan Univ Sci & Technol, POB 3030, Irbid 22110, Jordan
关键词
Hate speech; Arabic language; deep learning; classification; machine learning; Arabic tweets; SPEECH; TWITTER;
D O I
10.1145/3592792
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media is a common medium for expression of views, discussion, sharing of content, and promotion of products and ideas. These views are either polite or obscene. The growth of hate speech is one of the negative aspects of the medium and its emergence poses risk factors for society at various levels. Although there are rules and laws for these platforms, they cannot oversee and control all types of content. Thus, there is an urgent need to develop modern algorithms to automatically detect hateful content on social media. Arab society is not isolated from the world, and the usage of social media by its members has highlighted the importance of automated systems that help build an electronic society free of hate and aggression. This article aims to detect hate speech based on Arabic context over the Twitter platform by proposing different novel deep learning architectures in order to provide a thorough analytical study. Also, a comparative study is presented with a different well-known machine learning algorithm, as well as other state-of-the-art algorithms from the literature to be used as a beacon for interested researchers. These models have been applied to the Arabic tweets dataset, which included 15K tweets and 14 features. After training these models, the results obtained for the top two models included an improved bidirectional long short-term memory with an accuracy of 92.20% and a macro F1-score of 92% and a modified convolutional neural network with an accuracy of 92.10% and a macro F1-score of 91%. The results also showed the superiority of the performance of the deep learning models over other models in terms of accuracy.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] Arabic Social Media Analysis and Translation
    Mallek, Fatma
    Belainine, Billal
    Sadat, Fatiha
    ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 298 - 303
  • [42] Sentiment Lexicons for Arabic Social Media
    Mohammad, Saif M.
    Salameh, Mohammad
    Kiritchenko, Svetlana
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 33 - 37
  • [43] Preprocessing Arabic text on social media
    Hegazi, Mohamed Osman
    Al-Dossari, Yasser
    Al-Yahy, Abdullah
    Al-Sumari, Abdulaziz
    Hilal, Anwer
    HELIYON, 2021, 7 (02)
  • [44] Behavior analysis in Arabic social media
    Abutiheen Z.A.
    Mohammed E.A.
    Hussein M.H.
    International Journal of Speech Technology, 2022, 25 (03) : 659 - 666
  • [45] The problem of varying annotations to identify abusive language in social media content
    Seemann, Nina
    Lee, Yeong Su
    Hoellig, Julian
    Geierhos, Michaela
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (06) : 1561 - 1585
  • [46] Pretrained Transformer Language Models Versus Pretrained Word Embeddings for the Detection of Accurate Health Information on Arabic Social Media: Comparative Study
    Albalawi, Yahya
    Nikolov, Nikola S.
    Buckley, Jim
    JMIR FORMATIVE RESEARCH, 2022, 6 (06)
  • [47] "HOT" ChatGPT: ThePromiseofChatGPTinDetectingand Discriminating Hateful, Offensive, and Toxic Comments on Social Media
    Li, Lingyao
    Fan, Lizhou
    Atreja, Shubham
    Hemphill, Libby
    ACM TRANSACTIONS ON THE WEB, 2024, 18 (02)
  • [48] Non-Native Arabic Learners' Social Media Usage and Motivation Influencing Learning of Arabic Language in Malaysian Public Universities
    Xuan, Di
    Ismail, Wail Muin
    Zailani, Muhammad Azhar
    IJOLE-INTERNATIONAL JOURNAL OF LANGUAGE EDUCATION, 2020, 4 (02): : 258 - 275
  • [49] Offensive Language Detection on Social Media using Machine Learning
    Abdrakhmanov, Rustam
    Kenesbayev, Serik Muktarovich
    Berkimbayev, Kamalbek
    Toikenov, Gumyrbek
    Abdrashova, Elmira
    Alchinbayeva, Oichagul
    Ydyrys, Aizhan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (05) : 575 - 582
  • [50] Offensive Language Detection on Social Media Based on Text Classification
    Hajibabaee, Parisa
    Malekzadeh, Masoud
    Ahmadi, Mohsen
    Heidari, Maryam
    Esmaeilzadeh, Armin
    Abdolazimi, Reyhaneh
    Jones, James H., Jr.
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 92 - 98