Unmasking the Mask - Evaluating Social Biases in Masked Language Models

被引:0
|
作者
Kaneko, Masahiro [1 ]
Bollegala, Danushka [2 ,3 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
[2] Univ Liverpool, Liverpool, Merseyside, England
[3] Amazon, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Masked Language Models (MLMs) have shown superior performances in numerous downstream Natural Language Processing (NLP) tasks. Unfortunately, MLMs also demonstrate significantly worrying levels of social biases. We show that the previously proposed evaluation metrics for quantifying the social biases in MLMs are problematic due to the following reasons: (1) prediction accuracy of the masked tokens itself tend to be low in some MLMs, which leads to unreliable evaluation metrics, and (2) in most downstream NLP tasks, masks are not used; therefore prediction of the mask is not directly related to them, and (3) high-frequency words in the training data are masked more often, introducing noise due to this selection bias in the test cases. Therefore, we propose All Unmasked Likelihood (AUL), a bias evaluation measure that predicts all tokens in a test case given the MLM embedding of the unmasked input and AUL with Attention weights (AULA) to evaluate tokens based on their importance in a sentence. Our experimental results show that the proposed bias evaluation measures accurately detect different types of biases in MLMs, and unlike AUL and AULA, previously proposed measures for MLMs systematically overestimate the measured biases and are heavily influenced by the unmasked tokens in the context.
引用
收藏
页码:11954 / 11962
页数:9
相关论文
共 50 条
  • [21] The Diminishing Returns of Masked Language Models to Science
    Hong, Zhi
    Ajith, Aswathy
    Pauloski, J. Gregory
    Duede, Eamon
    Chard, Kyle
    Foster, Ian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1270 - 1283
  • [22] On Masked Language Models for Contextual Link Prediction
    Brayne, Angus
    Wiatrak, Maciej
    Corneil, Dane
    PROCEEDINGS OF DEEP LEARNING INSIDE OUT (DEELIO 2022): THE 3RD WORKSHOP ON KNOWLEDGE EXTRACTION AND INTEGRATION FOR DEEP LEARNING ARCHITECTURES, 2022, : 87 - 99
  • [23] Unsupervised Subtitle Segmentation with Masked Language Models
    Ponce, David
    Etchegoyhen, Thierry
    Ruiz, Victor
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 771 - 781
  • [24] DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
    He, Zhengfu
    Sun, Tianxiang
    Tang, Qiong
    Wang, Kuanning
    Huang, Xuanjing
    Qiu, Xipeng
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4521 - 4534
  • [25] Speciesist language and nonhuman animal bias in English Masked Language Models
    Takeshita, Masashi
    Rzepka, Rafal
    Araki, Kenji
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (05)
  • [26] Isotropy-Enhanced Conditional Masked Language Models
    Guo, Pei
    Xiao, Yisheng
    Li, Juntao
    Ji, Yixin
    Zhang, Min
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8278 - 8289
  • [27] Unmasking Societal Biases in Respiratory Support for ICU Patients through Social Determinants of Health
    Moukheiber, Mira
    Moukheiber, Lama
    Moukheiber, Dana
    Lee, Hyung-Chul
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 7421 - 7429
  • [28] Gender Bias in Masked Language Models for Multiple Languages
    Kaneko, Masahiro
    Imankulova, Aizhan
    Bollegala, Danushka
    Okazaki, Naoaki
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2740 - 2750
  • [29] Multilingual Normalization of Temporal Expressions with Masked Language Models
    Lange, Lukas
    Stroetgen, Jannik
    Adel, Heike
    Klakow, Dietrich
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1174 - 1186
  • [30] FairGauge: A Modularized Evaluation of Bias in Masked Language Models
    Doughman, Jad
    Shehata, Shady
    Karray, Fakhri
    PROCEEDINGS OF THE 2023 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2023, 2023, : 131 - 135