Neutralizing Gender Bias in Word Embeddings with Latent Disentanglement and Counterfactual Generation

被引:0
|
作者
Shin, Seungjae [1 ]
Song, Kyungwoo [1 ]
Jang, JoonHo [1 ]
Kim, Hyemi [1 ]
Joo, Weonyoung [1 ]
Moon, Il-Chul [1 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Daejeon, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research demonstrates that word embeddings, trained on the human-generated corpus, have strong gender biases in embedding spaces, and these biases can result in the discriminative results from the various downstream tasks. Whereas the previous methods project word embeddings into a linear subspace for debiasing, we introduce a Latent Disentanglement method with a siamese auto-encoder structure with an adapted gradient reversal layer. Our structure enables the separation of the semantic latent information and gender latent information of given word into the disjoint latent dimensions. Afterwards, we introduce a Counterfactual Generation to convert the gender information of words, so the original and the modified embeddings can produce a gender-neutralized word embedding after geometric alignment regularization, without loss of semantic information. From the various quantitative and qualitative debiasing experiments, our method shows to be better than existing debiasing methods in debiasing word embeddings. In addition, Our method shows the ability to preserve semantic information during debiasing by minimizing the semantic information losses for extrinsic NLP downstream tasks.
引用
下载
收藏
页数:15
相关论文
共 50 条
  • [11] Gender Bias in Word Embeddings: A Comprehensive Analysis of Frequency, Syntax, and Semantics
    Caliskan, Aylin
    Ajay, Pimparkar Parth
    Charlesworth, Tessa
    Wolfe, Robert
    Banaji, Mahzarin R.
    PROCEEDINGS OF THE 2022 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2022, 2022, : 156 - 170
  • [12] Equalizing Gender Bias in Neural Machine Translation with Word Embeddings Techniques
    Font, Joel Escude
    Costa-jussa, Marta R.
    GENDER BIAS IN NATURAL LANGUAGE PROCESSING (GEBNLP 2019), 2019, : 147 - 154
  • [13] The effects of gender bias in word embeddings on patient phenotyping in the mental health domain
    Sogancioglu, Gizem
    Kaya, Heysem
    Salah, Albert Ali
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, ACII, 2023,
  • [14] Robustness and Reliability of Gender Bias Assessment in Word Embeddings: The Role of Base Pairs
    Zhang, Haiyang
    Sneyd, Alison
    Stevenson, Mark
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 759 - 769
  • [15] Gender Bias Hidden Behind Chinese Word Embeddings: The Case of Chinese Adjectives
    Jiao, Meichun
    Luo, Ziyang
    GEBNLP 2021: THE 3RD WORKSHOP ON GENDER BIAS IN NATURAL LANGUAGE PROCESSING, 2021, : 8 - 15
  • [16] Understanding the Origins of Bias in Word Embeddings
    Brunet, Marc-Etienne
    Alkalay-Houlihan, Colleen
    Anderson, Ashton
    Zemel, Richard
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [17] Word Embeddings via Causal Inference: Gender Bias Reducing and Semantic Information Preserving
    Ding, Lei
    Yu, Dengdeng
    Xie, Jinhan
    Guo, Wenxing
    Hu, Shenggang
    Liu, Meichen
    Kong, Linglong
    Dai, Hongsheng
    Bao, Yanchun
    Jiang, Bei
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11864 - 11872
  • [18] Using Word Embeddings to Examine Gender Bias in Dutch Newspapers, 1950-1990
    Wevers, Melvin
    1ST INTERNATIONAL WORKSHOP ON COMPUTATIONAL APPROACHES TO HISTORICAL LANGUAGE CHANGE, 2019, : 92 - 97
  • [19] LEWIS: Latent Embeddings for Word Images and their Semantics
    Gordo, Albert
    Almazan, Jon
    Murray, Naila
    Perronnin, Florent
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1242 - 1250
  • [20] Decoupled Word Embeddings using Latent Topics
    Park, Heesoo
    Lee, Jongwuk
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 875 - 882