Multilingual Hate Speech Detection Using Semi-supervised Generative Adversarial Network

被引:1
|
作者
Mnassri, Khouloud [1 ]
Farahbakhsh, Reza [1 ]
Crespi, Noel [1 ]
机构
[1] Inst Polytech Paris, Samovar, Telecom SudParis, F-91120 Palaiseau, France
关键词
Hate Speech; offensive language; semi-supervised; GAN; mBERT; multilingual; social media;
D O I
10.1007/978-3-031-53503-1_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online communication has overcome linguistic and cultural barriers, enabling global connection through social media platforms. However, linguistic variety introduced more challenges in tasks such as the detection of hate speech content. Although multiple NLP solutions were proposed using advanced machine learning techniques, data annotation scarcity is still a serious problem urging the need for employing semi-supervised approaches. This paper proposes an innovative solution-a multilingual Semi-Supervised model based on Generative Adversarial Networks (GAN) and mBERT models, namely SS-GAN-mBERT. We managed to detect hate speech in Indo-European languages (in English, German, and Hindi) using only 20% labeled data from the HASOC2019 dataset. Our approach excelled in multilingual, zero-shot cross-lingual, and monolingual paradigms, achieving, on average, a 9.23% F1 score boost and 5.75% accuracy increase over baseline mBERT model.
引用
收藏
页码:192 / 204
页数:13
相关论文
共 50 条
  • [1] Multilingual Hate Speech Detection: A Semi-Supervised Generative Adversarial Approach
    Mnassri, Khouloud
    Farahbakhsh, Reza
    Crespi, Noel
    [J]. ENTROPY, 2024, 26 (04)
  • [2] DISCRIMINATIVE SEMI-SUPERVISED GENERATIVE ADVERSARIAL NETWORK FOR HYPERSPECTRAL ANOMALY DETECTION
    Jiang, Tao
    Xie, Weiying
    Li, Yunsong
    Du, Qian
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2420 - 2423
  • [3] Abnormal Transactions Detection in the Ethereum Network Using Semi-Supervised Generative Adversarial Networks
    Sanjalawe, Yousef K.
    Al-E'mari, Salam R.
    [J]. IEEE ACCESS, 2023, 11 : 98516 - 98531
  • [4] Semi-Supervised MIMO Detection Using Cycle-Consistent Generative Adversarial Network
    Zhu, Hongzhi
    Guo, Yongliang
    Xu, Wei
    You, Xiaohu
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2023, 9 (05) : 1226 - 1240
  • [5] Healthy-unhealthy animal detection using semi-supervised generative adversarial network
    Almal, Shubh
    Bagepalli, Apoorva Reddy
    Dutta, Prajjwal
    Chaki, Jyotismita
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [6] Healthy-unhealthy animal detection using semi-supervised generative adversarial network
    Almal, Shubh
    Bagepalli, Apoorva Reddy
    Dutta, Prajjwal
    Chaki, Jyotismita
    [J]. PeerJ Computer Science, 2023, 9
  • [7] Generative adversarial network for semi-supervised image captioning
    Liang, Xu
    Li, Chen
    Tian, Lihua
    [J]. Computer Vision and Image Understanding, 2024, 249
  • [8] Semi-supervised semantic segmentation using an improved generative adversarial network
    Xu, Di
    Wang, Zhili
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (05) : 9709 - 9719
  • [9] Poster Abstract: A Semi-Supervised Approach for Network Intrusion Detection Using Generative Adversarial Networks
    Jeong, Hyejeong
    Yu, Jieun
    Lee, Wonjun
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
  • [10] Semi-supervised Learning Using Generative Adversarial Networks
    Chang, Chuan-Yu
    Chen, Tzu-Yang
    Chung, Pau-Choo
    [J]. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 892 - 896