Detection of Hateful Social Media Content for Arabic Language

被引：5

作者：

Al-Ibrahim, Rogayah M. ^{[1
]}

Ali, Mostafa Z. ^{[1
]}

Najadat, Hassan M. ^{[1
]}

机构：

[1] Jordan Univ Sci & Technol, POB 3030, Irbid 22110, Jordan

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2023年 / 22卷 / 09期

关键词：

Hate speech; Arabic language; deep learning; classification; machine learning; Arabic tweets; SPEECH; TWITTER;

D O I：

10.1145/3592792

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Social media is a common medium for expression of views, discussion, sharing of content, and promotion of products and ideas. These views are either polite or obscene. The growth of hate speech is one of the negative aspects of the medium and its emergence poses risk factors for society at various levels. Although there are rules and laws for these platforms, they cannot oversee and control all types of content. Thus, there is an urgent need to develop modern algorithms to automatically detect hateful content on social media. Arab society is not isolated from the world, and the usage of social media by its members has highlighted the importance of automated systems that help build an electronic society free of hate and aggression. This article aims to detect hate speech based on Arabic context over the Twitter platform by proposing different novel deep learning architectures in order to provide a thorough analytical study. Also, a comparative study is presented with a different well-known machine learning algorithm, as well as other state-of-the-art algorithms from the literature to be used as a beacon for interested researchers. These models have been applied to the Arabic tweets dataset, which included 15K tweets and 14 features. After training these models, the results obtained for the top two models included an improved bidirectional long short-term memory with an accuracy of 92.20% and a macro F1-score of 92% and a modified convolutional neural network with an accuracy of 92.10% and a macro F1-score of 91%. The results also showed the superiority of the performance of the deep learning models over other models in terms of accuracy.

引用

页数：26

共 50 条

[31] Quantum computing and machine learning for Arabic language sentiment classification in social media
Ahmed Omar
Tarek Abd El-Hafeez
Scientific Reports, 13
[32] Clinical Trials in Social Media: Content Analysis of Available YouTube Videos in Arabic
Al-Tabba, Amal
Al-Omari, Amal
Al-Hussaini, Maysa
JOURNAL OF EMPIRICAL RESEARCH ON HUMAN RESEARCH ETHICS, 2020, 15 (03) : NP1 - NP2
[33] Geotagging Social Media Content with a Refined Language Modelling Approach
Kordopatis-Zilos, Giorgos
Papadopoulos, Symeon
Kompatsiaris, Yiannis
INTELLIGENCE AND SECURITY INFORMATICS, PAISI 2015, 2015, 9074 : 21 - 40
[34] Language Agnostic Model - Detecting Islamophobic Content on Social Media
Khan, Heena
Phillips, Joshua L.
ACMSE 2021: PROCEEDINGS OF THE 2021 ACM SOUTHEAST CONFERENCE, 2021, : 229 - 233
[35] A Survey of Offensive Language Detection for the Arabic Language
Husain, Fatemah
Uzuner, Ozlem
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (01)
[36] Hate Speech Detection in Social Media for the Kurdish Language
Saeed, Ari M.
Ismael, Aso N.
Rasul, Danya L.
Majeed, Rayan S.
Rashid, Tarik A.
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INNOVATIONS IN COMPUTING RESEARCH (ICR'22), 2022, 1431 : 253 - 260
[37] Sarcasm Detection in Politically Motivated Social Media Content
Nguyen, Hieu
Moon, Jihye
Paul, Nijhum
Gokhale, Swapna S.
19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1538 - 1545
[38] Community Detection with Edge Content in Social Media Networks
Qi, Guo-Jun
Aggarwal, Charu C.
Huang, Thomas
2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 534 - 545
[39] Cancer Treatment Using Herbals in Arabic Social Media: Content Analysis of YouTube Videos
Abu Daabes, Ajayeb S.
2018 1ST INTERNATIONAL CONFERENCE ON CANCER CARE INFORMATICS (CCI), 2018, : 215 - 216
[40] Enhancing Arabic Dialect Detection on Social Media: A Hybrid Model with an Attention Mechanism
Yafooz, Wael M. S.
INFORMATION, 2024, 15 (06)

← 1 2 3 4 5 →