Towards safer online communities: Deep learning and explainable AI for hate speech detection and classification

被引:5
|
作者
Kibriya, Hareem [1 ]
Siddiqa, Ayesha [1 ]
Khan, Wazir Zada [1 ]
Khan, Muhammad Khurram [2 ]
机构
[1] Univ Wah, Dept Comp Sci, Wah Cantt 47040, Pakistan
[2] King Saud Univ, Ctr Excellence Informat Assurance, Riyadh 11451, Saudi Arabia
关键词
Hate speech detection; Social media; Deep learning; Explainable Artificial Intelligence; Machine learning; Toxic comments; Hate speech;
D O I
10.1016/j.compeleceng.2024.109153
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The internet and social media facilitate widespread idea sharing but also contribute to cybercrimes and harmful behaviors, notably the dissemination of abusive and hateful speech, which poses a significant threat to societal cohesion. Hence, prompt and accurate detection of such harmful content is crucial. To address this issue, our study introduces a fully automated end-toend model for hate speech detection and classification using Natural Language Processing and Deep Learning techniques. The proposed architecture comprising embedding, Convolutional, bidirectional Recurrent Neural Network, and bidirectional Long Short Term Memory layers, achieved the highest accuracy of 98.5%. Additionally, we employ explainable AI techniques, such as SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME), to gain insights into the performance of the proposed framework. This comprehensive approach meets the pressing demand for swift and precise detection and categorization of harmful online content.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
    Mathew, Binny
    Saha, Punyajoy
    Yimam, Seid Muhie
    Biemann, Chris
    Goyal, Pawan
    Mukherjee, Animesh
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14867 - 14875
  • [22] Improving Sinhala Hate Speech Detection Using Deep Learning
    Gamage, Kavishka
    Welgama, Viraj
    Weerasinghe, Ruvan
    2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [23] Decoding Fake News and Hate Speech: A Survey of Explainable AI Techniques
    Ngueajio, Mikel k.
    Aryal, Saurav
    Atemkeng, Marcellin
    Washington, Gloria
    Rawat, Danda
    ACM COMPUTING SURVEYS, 2025, 57 (07)
  • [24] Detection of hate speech in Arabic tweets using deep learning
    Al-Hassan, Areej
    Al-Dossari, Hmood
    MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1963 - 1974
  • [25] Detection of hate speech in Arabic tweets using deep learning
    Areej Al-Hassan
    Hmood Al-Dossari
    Multimedia Systems, 2022, 28 : 1963 - 1974
  • [26] Deep Learning Based Fusion Approach for Hate Speech Detection
    Zhou, Yanling
    Yang, Yanyan
    Liu, Han
    Liu, Xiufeng
    Savage, Nick
    IEEE ACCESS, 2020, 8 : 128923 - 128929
  • [27] Towards Hate Speech Detection at Large via Deep Generative Modeling
    Wullach, Tomer
    Adler, Amir
    Minkov, Einat
    IEEE INTERNET COMPUTING, 2021, 25 (02) : 48 - 57
  • [28] Deep Learning for Hate Speech Intensity Analysis: DistilBERT Classification Algorithm
    Riyadi, Slamet
    Masyhur, Ahmad Musthafa
    Andriyani, Annisa Divayu
    2024 IEEE SYMPOSIUM ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ISIEA 2024, 2024,
  • [29] Performance Comparison of Machine Learning and Deep Learning Algorithms in Detecting Online Hate Speech
    Shibly, F. H. A.
    Sharma, Uzzal
    Naleer, H. M. M.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 695 - 706
  • [30] Evaluation of Different Machine Learning and Deep Learning Techniques for Hate Speech Detection
    Shawkat, Nabil
    Saquer, Jamil
    Shatnawi, Hazim
    PROCEEDINGS OF THE 2024 ACM SOUTHEAST CONFERENCE, ACMSE 2024, 2024, : 253 - 258