Unmasking the Mask - Evaluating Social Biases in Masked Language Models

被引:0
|
作者
Kaneko, Masahiro [1 ]
Bollegala, Danushka [2 ,3 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
[2] Univ Liverpool, Liverpool, Merseyside, England
[3] Amazon, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Masked Language Models (MLMs) have shown superior performances in numerous downstream Natural Language Processing (NLP) tasks. Unfortunately, MLMs also demonstrate significantly worrying levels of social biases. We show that the previously proposed evaluation metrics for quantifying the social biases in MLMs are problematic due to the following reasons: (1) prediction accuracy of the masked tokens itself tend to be low in some MLMs, which leads to unreliable evaluation metrics, and (2) in most downstream NLP tasks, masks are not used; therefore prediction of the mask is not directly related to them, and (3) high-frequency words in the training data are masked more often, introducing noise due to this selection bias in the test cases. Therefore, we propose All Unmasked Likelihood (AUL), a bias evaluation measure that predicts all tokens in a test case given the MLM embedding of the unmasked input and AUL with Attention weights (AULA) to evaluate tokens based on their importance in a sentence. Our experimental results show that the proposed bias evaluation measures accurately detect different types of biases in MLMs, and unlike AUL and AULA, previously proposed measures for MLMs systematically overestimate the measured biases and are heavily influenced by the unmasked tokens in the context.
引用
收藏
页码:11954 / 11962
页数:9
相关论文
共 50 条
  • [1] Robust Evaluation Measures for Evaluating Social Biases in Masked Language Models
    Liu, Yang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18707 - 18715
  • [2] CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models
    Nangia, Nikita
    Vania, Clara
    Bhalerao, Rasika
    Bowman, Samuel R.
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1953 - 1967
  • [3] A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models
    Zhou, Yi
    Camacho-Collados, Jose
    Bollegala, Danushka
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 11082 - 11100
  • [4] Race, Gender, and Age Biases in Biomedical Masked Language Models
    Kim, Michelle YoungJin
    Kim, Junghwan
    Johnson, Kristen Marie
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11806 - 11815
  • [5] Mask-Predict: Parallel Decoding of Conditional Masked Language Models
    Ghazvininejad, Marjan
    Levy, Omer
    Liu, Yinhan
    Zettlemoyer, Luke
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6112 - 6121
  • [6] Unmasking the Mask Debate on Social Media
    Cerbin, Luca
    DeJesus, Jason
    Warnken, Julia
    Gokhale, Swapna S.
    2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 677 - 682
  • [7] Generative language models exhibit social identity biases
    Hu, Tiancheng
    Kyrychenko, Yara
    Rathje, Steve
    Collier, Nigel
    van der Linden, Sander
    Roozenbeek, Jon
    NATURE COMPUTATIONAL SCIENCE, 2025, 5 (01): : 65 - 75
  • [8] Towards Understanding and Mitigating Social Biases in Language Models
    Liang, Paul Pu
    Wu, Chiyu
    Morency, Louis-Philippe
    Salakhutdinov, Ruslan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] Unmasking Zorro: functional importance of the facial mask in the Masked Shrike (Lanius nubicus)
    Yosef, Reuven
    Zduniak, Piotr
    Tryjanowski, Piotr
    BEHAVIORAL ECOLOGY, 2012, 23 (03) : 615 - 618
  • [10] Deriving Language Models from Masked Language Models
    Hennigen, Lucas Torroba
    Kim, Yoon
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1149 - 1159