Data Augmentation for Improving Explainability of Hate Speech Detection

被引:0
|
作者
Ansari, Gunjan [1 ]
Kaur, Parmeet [2 ]
Saxena, Chandni [3 ]
机构
[1] JSS Acad Tech Educ, Dept Informat Technol, Noida, India
[2] Jaypee Inst Informat Technol, Dept Comp Sci & Informat Technol, Noida, India
[3] Chinese Univ Hong Kong, SAR, Hong Kong, Peoples R China
关键词
Hate speech; Cyberbullying; Explainable AI; Data augmentation; LIME; Integrated gradient;
D O I
10.1007/s13369-023-08100-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The paper presents a novel data augmentation-based approach to develop explainable, deep learning models for hate speech detection. Hate speech is widely prevalent on online social media but difficult to detect automatically due to challenges of natural language processing and complexity of hate speech. Further, the decisions of the existing solutions possess constrained explainability since limited annotated data are available for training and testing of models. Therefore, this work proposes the use of text-based data augmentation for improving the performance and explainability of deep learning models. Techniques based on easy data augmentation, bidirectional encoder representations from transformers and back translation have been utilized for data augmentation. Convolutional neural networks and long short-term memory models are trained with augmented data and evaluated on two publicly available datasets for hate speech detection. Methods of LIME and integrated gradients are used to retrieve explanations of the deep learning models. A diagnostic study is conducted on test samples to check for improvement in the models as a result of the data augmentation. The experimental results verify that the proposed approach improves the explainability as well as the accuracy of hate speech detection.
引用
收藏
页码:3609 / 3621
页数:13
相关论文
共 50 条
  • [1] Data Augmentation for Improving Explainability of Hate Speech Detection
    Gunjan Ansari
    Parmeet Kaur
    Chandni Saxena
    Arabian Journal for Science and Engineering, 2024, 49 : 3609 - 3621
  • [2] Exploring Data Augmentation Strategies for Hate Speech Detection in Roman Urdu
    Azam, Ubaid
    Rizwan, Hammad
    Karim, Asim
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4523 - 4531
  • [3] Application of Data Augmentation Techniques for Hate Speech Detection with Deep Learning
    Venturott, Ligia Iunes
    Ciarelli, Patrick Marques
    PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2021), 2021, 12981 : 778 - 787
  • [4] Data-Efficient Methods For Improving Hate Speech Detection
    Roychowdhury, Sumegh
    Gupta, Vikram
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 125 - 132
  • [5] Token replacement-based data augmentation methods for hate speech detection
    Kosisochukwu Judith Madukwe
    Xiaoying Gao
    Bing Xue
    World Wide Web, 2022, 25 : 1129 - 1150
  • [6] Token replacement-based data augmentation methods for hate speech detection
    Madukwe, Kosisochukwu Judith
    Gao, Xiaoying
    Xue, Bing
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (03): : 1129 - 1150
  • [7] An approach of data augmentation to improve the performance of BERTology models for vietnamese hate speech detection
    Luu, Son T.
    Van Nguyen, Kiet
    Nguyen, Ngan Luu-Thuy
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (19) : 56763 - 56783
  • [8] Improving Hate Speech Detection with Deep Learning Ensembles
    Zimmerman, Steven
    Fox, Chris
    Kruschwitz, Udo
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2546 - 2553
  • [9] Improving Counterfactual Generation for Fair Hate Speech Detection
    Davani, Aida Mostafazadeh
    Omrani, Ar
    Kennedy, Brendan
    Atari, Mohammad
    Ren, Xiang
    Dehghani, Morteza
    WOAH 2021: THE 5TH WORKSHOP ON ONLINE ABUSE AND HARMS, 2021, : 92 - 101
  • [10] Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations
    Oneata, Dan
    Cucu, Horia
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4578 - 4587