The effect of gender bias on hate speech detection

被引:0
|
作者
Furkan Şahinuç
Eyup Halit Yilmaz
Cagri Toraman
Aykut Koç
机构
[1] Aselsan Research Center,Department of Electrical and Electronics Engineering
[2] Bilkent University,The National Magnetic Resonance Research Center
[3] Bilkent University,undefined
来源
关键词
Debiased embedding; Deep learning; Gender identity; Hate speech; Language model;
D O I
暂无
中图分类号
学科分类号
摘要
Hate speech against individuals or communities with different backgrounds is a major problem in online social networks. The domain of hate speech has spread to various topics, including race, religion, and gender. Although there are many efforts for hate speech detection in different domains and languages, the effects of gender identity are not solely examined in hate speech detection. Moreover, hate speech detection is mostly studied for particular languages, specifically English, but not low-resource languages, such as Turkish. We examine gender identity-based hate speech detection for both English and Turkish tweets. We compare the performances of state-of-the-art models using 20 k tweets per language. We observe that transformer-based language models outperform bag-of-words and deep learning models, while the conventional bag-of-words model has surprising performances, possibly due to offensive or hate-related keywords. Furthermore, we analyze the effect of debiased embeddings for hate speech detection. We find that the performance can be improved by removing the gender-related bias in neural embeddings since gender-biased words can have offensive or hateful implications.
引用
收藏
页码:1591 / 1597
页数:6
相关论文
共 50 条
  • [1] The effect of gender bias on hate speech detection
    Sahinuc, Furkan
    Yilmaz, Eyup Halit
    Toraman, Cagri
    Koc, Aykut
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1591 - 1597
  • [2] Bias in Hate Speech and Toxicity Detection
    Lobo, Paula Reyero
    [J]. PROCEEDINGS OF THE 2022 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2022, 2022, : 910 - 910
  • [3] The Risk of Racial Bias in Hate Speech Detection
    Sap, Maarten
    Card, Dallas
    Gabriel, Saadia
    Choi, Yejin
    Smith, Noah A.
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1668 - 1678
  • [4] BERT-Based Logits Ensemble Model for Gender Bias and Hate Speech Detection
    Yun, Sanggeon
    Kang, Seungshik
    Kim, Hyeokman
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (05): : 641 - 651
  • [5] Systematic keyword and bias analyses in hate speech detection
    Sarracen, Gretel Liz De la Pella
    Rosso, Paolo
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (05)
  • [6] Unintended bias evaluation: An analysis of hate speech detection and gender bias mitigation on social media using ensemble learning
    Nascimento, Francimaria R. S.
    Cavalcanti, George D. C.
    Da Costa-Abreu, Marjory
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 201
  • [7] Racial Bias in Hate Speech and Abusive Language Detection Datasets
    Davidson, Thomas
    Bhattacharya, Debasmita
    Weber, Ingmar
    [J]. THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE, 2019, : 25 - 35
  • [8] Polarization and hate speech with gender bias associated with politics: analysis of interactions on Twitter
    Blanco-Alfonso, Ignacio
    Rodriguez-Fernandez, Leticia
    Arce-Garcia, Sergio
    [J]. REVISTA DE COMUNICACION-PERU, 2022, 21 (02): : 33 - 50
  • [9] GENDER AND THE PROHIBITION OF HATE SPEECH
    Weston-Scheuber, Kylie
    [J]. QUT LAW REVIEW, 2012, 12 (02): : 132 - 150
  • [10] Bias Detection and Mitigation in Textual Data: A Study on Fake News and Hate Speech Detection
    Kasampalis, Apostolos
    Chatzakou, Despoina
    Tsikrika, Theodora
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III, 2024, 14610 : 374 - 383