MarIA and BETO are sexist: evaluating gender bias in large language models for Spanish

被引：3

作者：

Garrido-Munoz, Ismael ^{[1
]}

Martinez-Santiago, Fernando ^{[1
]}

Montejo-Raez, Arturo ^{[1
]}

机构：

[1] Univ Jaen, CEATIC, Campus Las Lagunillas, Jaen 23071, Spain

来源：

LANGUAGE RESOURCES AND EVALUATION | 2024年 / 58卷 / 04期

关键词：

Deep learning; Gender bias; Bias evaluation; Language model; BERT; RoBERTa;

D O I：

10.1007/s10579-023-09670-3

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The study of bias in language models is a growing area of work, however, both research and resources are focused on English. In this paper, we make a first approach focusing on gender bias in some freely available Spanish language models trained using popular deep neural networks, like BERT or RoBERTa. Some of these models are known for achieving state-of-the-art results on downstream tasks. These promising results have promoted such models' integration in many real-world applications and production environments, which could be detrimental to people affected for those systems. This work proposes an evaluation framework to identify gender bias in masked language models, with explainability in mind to ease the interpretation of the evaluation results. We have evaluated 20 different models for Spanish, including some of the most popular pretrained ones in the research community. Our findings state that varying levels of gender bias are present across these models.This approach compares the adjectives proposed by the model for a set of templates. We classify the given adjectives into understandable categories and compute two new metrics from model predictions, one based on the internal state (probability) and the other one on the external state (rank). Those metrics are used to reveal biased models according to the given categories and quantify the degree of bias of the models under study.

引用

页码：1387 / 1417

页数：31

共 50 条

[21] Evaluating large language models in pediatric nephrology
Filler, Guido
Niel, Olivier
PEDIATRIC NEPHROLOGY, 2025,
[22] Evaluating large language models as agents in the clinic
Nikita Mehandru
Brenda Y. Miao
Eduardo Rodriguez Almaraz
Madhumita Sushil
Atul J. Butte
Ahmed Alaa
npj Digital Medicine, 7
[23] EVALUATING LARGE LANGUAGE MODELS ON THEIR ACCURACY AND COMPLETENESS
Edalat, Camellia
Kirupaharan, Nila
Dalvin, Lauren A.
Mishra, Kapil
Marshall, Rayna
Xu, Hannah
Francis, Jasmine H.
Berkenstock, Meghan
RETINA-THE JOURNAL OF RETINAL AND VITREOUS DISEASES, 2025, 45 (01): : 128 - 132
[24] Evaluating large language models for software testing
Li, Yihao
Liu, Pan
Wang, Haiyang
Chu, Jie
Wong, W. Eric
COMPUTER STANDARDS & INTERFACES, 2025, 93
[25] Evaluating Intelligence and Knowledge in Large Language Models
Bianchini, Francesco
TOPOI-AN INTERNATIONAL REVIEW OF PHILOSOPHY, 2025, 44 (01): : 163 - 173
[26] Evaluating large language models as agents in the clinic
Mehandru, Nikita
Miao, Brenda Y.
Almaraz, Eduardo Rodriguez
Sushil, Madhumita
Butte, Atul J.
Alaa, Ahmed
NPJ DIGITAL MEDICINE, 2024, 7 (01)
[27] Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance
Wambsganss, Thiemo
Su, Xiaotian
Swamy, Vinitra
Neshaei, Seyed Parsa
Rietsche, Roman
Kaser, Tanja
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10275 - 10288
[28] Pipelines for Social Bias Testing of Large Language Models
Nozza, Debora
Bianchi, Federico
Hovy, Dirk
PROCEEDINGS OF WORKSHOP ON CHALLENGES & PERSPECTIVES IN CREATING LARGE LANGUAGE MODELS (BIGSCIENCE EPISODE #5), 2022, : 68 - 74
[29] A Causal View of Entity Bias in (Large) Language Models
Wang, Fei
Mo, Wenjie
Wang, Yiwei
Zhou, Wenxuan
Chen, Muhao
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15173 - 15184
[30] Do Large Language Models Bias Human Evaluations?
O'Leary, Daniel E.
IEEE INTELLIGENT SYSTEMS, 2024, 39 (04) : 83 - 87

← 1 2 3 4 5 →