Towards safer online communities: Deep learning and explainable AI for hate speech detection and classification

被引:5
|
作者
Kibriya, Hareem [1 ]
Siddiqa, Ayesha [1 ]
Khan, Wazir Zada [1 ]
Khan, Muhammad Khurram [2 ]
机构
[1] Univ Wah, Dept Comp Sci, Wah Cantt 47040, Pakistan
[2] King Saud Univ, Ctr Excellence Informat Assurance, Riyadh 11451, Saudi Arabia
关键词
Hate speech detection; Social media; Deep learning; Explainable Artificial Intelligence; Machine learning; Toxic comments; Hate speech;
D O I
10.1016/j.compeleceng.2024.109153
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The internet and social media facilitate widespread idea sharing but also contribute to cybercrimes and harmful behaviors, notably the dissemination of abusive and hateful speech, which poses a significant threat to societal cohesion. Hence, prompt and accurate detection of such harmful content is crucial. To address this issue, our study introduces a fully automated end-toend model for hate speech detection and classification using Natural Language Processing and Deep Learning techniques. The proposed architecture comprising embedding, Convolutional, bidirectional Recurrent Neural Network, and bidirectional Long Short Term Memory layers, achieved the highest accuracy of 98.5%. Additionally, we employ explainable AI techniques, such as SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME), to gain insights into the performance of the proposed framework. This comprehensive approach meets the pressing demand for swift and precise detection and categorization of harmful online content.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Improving Hate Speech Classification Through Ensemble Learning and Explainable AI Techniques
    Garg, Priya
    Sharma, M. K.
    Kumar, Parteek
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [2] Hate Speech Detection in Audio Using SHAP - An Explainable AI
    Imbwaga, Joan L.
    Chittaragi, Nagaratna B.
    Koolagudi, Shashidhar G.
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT II, 2024, 2091 : 289 - 304
  • [3] Hate Speech is not Free Speech: Explainable Machine Learning for Hate Speech Detection in Code-Mixed Languages
    Yadav, Sargam
    Kaushik, Abhishek
    McDaid, Kevin
    2023 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGY AND SOCIETY, ISTAS, 2023,
  • [4] Deep Learning Ensembles for Hate Speech Detection
    Alsafari, Safa
    Sadaoui, Samira
    Mouhoub, Malek
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 526 - 531
  • [5] Deep Learning for Hate Speech Detection in Tweets
    Badjatiya, Pinkesh
    Gupta, Shashank
    Gupta, Manish
    Varma, Vasudeva
    WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
  • [6] Fine-Grained Multilingual Hate Speech Detection Using Explainable AI and Transformers
    Siddiqui, Jawaid Ahmed
    Yuhaniz, Siti Sophiayati
    Shaikh, Ghulam Mujtaba
    Soomro, Safdar Ali
    Mahar, Zafar Ali
    IEEE ACCESS, 2024, 12 : 143177 - 143192
  • [7] Multilingual Hate Speech Detection: Innovations in Optimized Deep Learning for English and Arabic Hate Speech Detection
    Hassan AL-Sukhani
    Qusay Bsoul
    Abdelrahman H. Elhawary
    Ziad M. Nasr
    Ahmed E. Mansour
    Radwan M. Batyha
    Basma S. Alqadi
    Jehad Saad Alqurni
    Hayat Alfagham
    Magda M. Madbouly
    SN Computer Science, 6 (3)
  • [8] Explainable hate speech detection using LIME
    Joan L. Imbwaga
    Nagaratna B. Chittaragi
    Shashidhar G. Koolagudi
    International Journal of Speech Technology, 2024, 27 (3) : 793 - 815
  • [9] Glaucoma Detection Using Explainable AI and Deep Learning
    Afreen N.
    Aluvalu R.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
  • [10] Deep Explainable Hate Speech Active Learning on Social-Media Data
    Ahmed, Usman
    Lin, Jerry Chun-Wei
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 4625 - 4635