The effect of gender bias on hate speech detection

被引:0
|
作者
Sahinuc, Furkan [1 ]
Yilmaz, Eyup Halit [1 ]
Toraman, Cagri [1 ]
Koc, Aykut [2 ,3 ]
机构
[1] Aselsan Res Ctr, TR-06200 Ankara, Turkey
[2] Bilkent Univ, Dept Elect & Elect Engn, TR-06800 Ankara, Turkey
[3] Bilkent Univ, Natl Magnet Resonance Res Ctr, TR-06800 Ankara, Turkey
关键词
Debiased embedding; Deep learning; Gender identity; Hate speech; Language model;
D O I
10.1007/s11760-022-02368-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hate speech against individuals or communities with different backgrounds is a major problem in online social networks. The domain of hate speech has spread to various topics, including race, religion, and gender. Although there are many efforts for hate speech detection in different domains and languages, the effects of gender identity are not solely examined in hate speech detection. Moreover, hate speech detection is mostly studied for particular languages, specifically English, but not low-resource languages, such as Turkish. We examine gender identity-based hate speech detection for both English and Turkish tweets. We compare the performances of state-of-the-art models using 20 k tweets per language. We observe that transformer-based language models outperform bag-of-words and deep learning models, while the conventional bag-of-words model has surprising performances, possibly due to offensive or hate-related keywords. Furthermore, we analyze the effect of debiased embeddings for hate speech detection. We find that the performance can be improved by removing the gender-related bias in neural embeddings since gender-biased words can have offensive or hateful implications.
引用
收藏
页码:1591 / 1597
页数:7
相关论文
共 50 条
  • [31] Levantine hate speech detection in twitter
    Medyan AbdelHamid
    Assef Jafar
    Yasser Rahal
    [J]. Social Network Analysis and Mining, 2022, 12
  • [32] Detection of Hate Speech using BERT and Hate Speech Word Embedding with Deep Model
    Saleh, Hind
    Alhothali, Areej
    Moria, Kawthar
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2023, 37 (01)
  • [33] A Federated Approach for Hate Speech Detection
    Gala, Jay
    Gandhi, Deep
    Mehta, Jash
    Talat, Zeerak
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3248 - 3259
  • [34] Levantine hate speech detection in twitter
    AbdelHamid, Medyan
    Jafar, Assef
    Rahal, Yasser
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [35] Hate Speech Detection in Roman Urdu
    Khan, Muhammad Moin
    Shahzad, Khurram
    Malik, Muhammad Kamran
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (01)
  • [36] Prosecutorial perspectives on gender-bias hate crimes
    McPhail, BA
    DiNitto, DM
    [J]. VIOLENCE AGAINST WOMEN, 2005, 11 (09) : 1162 - 1185
  • [37] Hate Speech is not Free Speech: Explainable Machine Learning for Hate Speech Detection in Code-Mixed Languages
    Yadav, Sargam
    Kaushik, Abhishek
    McDaid, Kevin
    [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGY AND SOCIETY, ISTAS, 2023,
  • [38] A valid question: Could hate speech condition bias in the brain?
    Murrow, Gail B.
    Murrow, Richard
    [J]. JOURNAL OF LAW AND THE BIOSCIENCES, 2016, 3 (01): : 196 - 201
  • [40] Is hate speech detection the solution the world wants?
    Parker, Sara
    Ruths, Derek
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (10)