Towards safer online communities: Deep learning and explainable AI for hate speech detection and classification

被引：5

作者：

Kibriya, Hareem ^{[1
]}

Siddiqa, Ayesha ^{[1
]}

Khan, Wazir Zada ^{[1
]}

Khan, Muhammad Khurram ^{[2
]}

机构：

[1] Univ Wah, Dept Comp Sci, Wah Cantt 47040, Pakistan

[2] King Saud Univ, Ctr Excellence Informat Assurance, Riyadh 11451, Saudi Arabia

来源：

COMPUTERS & ELECTRICAL ENGINEERING | 2024年 / 116卷

关键词：

Hate speech detection; Social media; Deep learning; Explainable Artificial Intelligence; Machine learning; Toxic comments; Hate speech;

D O I：

10.1016/j.compeleceng.2024.109153

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The internet and social media facilitate widespread idea sharing but also contribute to cybercrimes and harmful behaviors, notably the dissemination of abusive and hateful speech, which poses a significant threat to societal cohesion. Hence, prompt and accurate detection of such harmful content is crucial. To address this issue, our study introduces a fully automated end-toend model for hate speech detection and classification using Natural Language Processing and Deep Learning techniques. The proposed architecture comprising embedding, Convolutional, bidirectional Recurrent Neural Network, and bidirectional Long Short Term Memory layers, achieved the highest accuracy of 98.5%. Additionally, we employ explainable AI techniques, such as SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME), to gain insights into the performance of the proposed framework. This comprehensive approach meets the pressing demand for swift and precise detection and categorization of harmful online content.

引用

页数：15

共 50 条

[1] Improving Hate Speech Classification Through Ensemble Learning and Explainable AI Techniques
Garg, Priya
Sharma, M. K.
Kumar, Parteek
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
[2] Hate Speech Detection in Audio Using SHAP - An Explainable AI
Imbwaga, Joan L.
Chittaragi, Nagaratna B.
Koolagudi, Shashidhar G.
ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT II, 2024, 2091 : 289 - 304
[3] Hate Speech is not Free Speech: Explainable Machine Learning for Hate Speech Detection in Code-Mixed Languages
Yadav, Sargam
Kaushik, Abhishek
McDaid, Kevin
2023 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGY AND SOCIETY, ISTAS, 2023,
[4] Deep Learning Ensembles for Hate Speech Detection
Alsafari, Safa
Sadaoui, Samira
Mouhoub, Malek
2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 526 - 531
[5] Deep Learning for Hate Speech Detection in Tweets
Badjatiya, Pinkesh
Gupta, Shashank
Gupta, Manish
Varma, Vasudeva
WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
[6] Fine-Grained Multilingual Hate Speech Detection Using Explainable AI and Transformers
Siddiqui, Jawaid Ahmed
Yuhaniz, Siti Sophiayati
Shaikh, Ghulam Mujtaba
Soomro, Safdar Ali
Mahar, Zafar Ali
IEEE ACCESS, 2024, 12 : 143177 - 143192
[7] Multilingual Hate Speech Detection: Innovations in Optimized Deep Learning for English and Arabic Hate Speech Detection
Hassan AL-Sukhani
Qusay Bsoul
Abdelrahman H. Elhawary
Ziad M. Nasr
Ahmed E. Mansour
Radwan M. Batyha
Basma S. Alqadi
Jehad Saad Alqurni
Hayat Alfagham
Magda M. Madbouly
SN Computer Science, 6 (3)
[8] Explainable hate speech detection using LIME
Joan L. Imbwaga
Nagaratna B. Chittaragi
Shashidhar G. Koolagudi
International Journal of Speech Technology, 2024, 27 (3) : 793 - 815
[9] Glaucoma Detection Using Explainable AI and Deep Learning
Afreen N.
Aluvalu R.
EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
[10] Deep Explainable Hate Speech Active Learning on Social-Media Data
Ahmed, Usman
Lin, Jerry Chun-Wei
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 4625 - 4635

← 1 2 3 4 5 →