A Deep Learning Framework for Automatic Detection of Hate Speech Embedded in Arabic Tweets

被引：0

作者：

Rehab Duwairi

Amena Hayajneh

Muhannad Quwaider

机构：

[1] Jordan University of Science and Technology,

来源：

Arabian Journal for Science and Engineering | 2021年 / 46卷

关键词：

Arabic hate speech; Neural networks; Automatic detection of hateful speech; Deep learning; Text mining; Twitter;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we investigate the ability of CNN, CNN-LSTM, and BiLSTM-CNN deep learning networks to automatically classify or discover hateful content posted on social media. These deep networks were trained and tested using ArHS dataset which consists of 9833 tweets that were annotated to suite hateful speech detection in Arabic. To the best of our knowledge, this is the largest Arabic dataset which handles the subclasses of hate speech. Moreover, we investigate the performance on two existing Arabic hate speech datasets along with ArHS dataset resulting in a combined dataset which consists of 23,678 tweets. Three types of experiment are reported: first, the binary classification of tweets into Hate or Normal, second, ternary classification of tweets into (Hate, Abusive, or Normal), and lastly, multi-class classification of tweets into (Misogyny, Racism, Religious Discrimination, Abusive, and Normal). Using the ArHS dataset, in the binary classification task, the CNN model outperformed other models and achieved an accuracy of 81%. In the ternary classification task, both the CNN and BiLSTM-CNN models achieved the best accuracy of 74%. Lastly, in the multi-class classification task, CNN-LSTM and the BiLSTM-CNN models both achieved the best results with an accuracy of 73%. On the Combined dataset, in the binary classification task, the BiLSTM-CNN achieved an accuracy of 73%. In the ternary classification task, BiLSTM-CNN achieved the best accuracy of 67%. Lastly, in the multi-class classification task, the CNN-LSTM and the BiLSTM-CNN achieved the best accuracy of 65%.

引用

页码：4001 / 4014

页数：13

共 50 条

[31] Intelligent detection of hate speech in Arabic social network: A machine learning approach
Aljarah, Ibrahim
Habib, Maria
Hijazi, Neveen
Faris, Hossam
Qaddoura, Raneem
Hammo, Bassam
Abushariah, Mohammad
Alfawareh, Mohammad
JOURNAL OF INFORMATION SCIENCE, 2021, 47 (04) : 483 - 501
[32] Deep Learning Based Fusion Approach for Hate Speech Detection
Zhou, Yanling
Yang, Yanyan
Liu, Han
Liu, Xiufeng
Savage, Nick
IEEE ACCESS, 2020, 8 : 128923 - 128929
[33] A Framework for Hate Speech Detection Using Deep Convolutional Neural Network
Roy, Pradeep Kumar
Tripathy, Asis Kumar
Das, Tapan Kumar
Gao, Xiao-Zhi
IEEE ACCESS, 2020, 8 : 204951 - 204962
[34] Affect detection from arabic tweets using ensemble and deep learning techniques
AlZoubi, Omar
Tawalbeh, Saja Khaled
AL-Smadi, Mohammad
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 2529 - 2539
[35] Automatic Hate Speech Detection using Machine Learning: A Comparative Study
Abro, Sindhu
Shaikh, Sarang
Ali, Zafar
Khan, Sajid
Mujtaba, Ghulam
Khand, Zahid Hussain
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (08) : 484 - 491
[36] Automatic hate speech detection in audio using machine learning algorithms
Imbwaga J.L.
Chittaragi N.B.
Koolagudi S.G.
International Journal of Speech Technology, 2024, 27 (02) : 447 - 469
[37] Annotation Framework for Hate Speech Identification in Tweets: Case Study of Tweets During Kenyan Elections
Ombui, Edward
Karani, Moses
Muchemi, Lawrence
2019 IST-AFRICA WEEK CONFERENCE (IST-AFRICA), 2019,
[38] Automatic sarcasm detection in Arabic tweets: resources and approaches
Mihi, Soukaina
Benali, Brahim Ait
Laachfoubi, Nabil
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9483 - 9497
[39] Automatic Spam Detection on Gulf Dialectical Arabic Tweets
Alorini, Dema
Rawat, Danda B.
2019 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2019, : 448 - 452
[40] Evaluation of Different Machine Learning and Deep Learning Techniques for Hate Speech Detection
Shawkat, Nabil
Saquer, Jamil
Shatnawi, Hazim
PROCEEDINGS OF THE 2024 ACM SOUTHEAST CONFERENCE, ACMSE 2024, 2024, : 253 - 258

← 1 2 3 4 5 →