Pipelines for Social Bias Testing of Large Language Models

被引:0
|
作者
Nozza, Debora [1 ]
Bianchi, Federico [1 ]
Hovy, Dirk [1 ]
机构
[1] Bocconi Univ, Via Sarfatti 25, Milan, Italy
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The maturity level of language models is now at a stage in which many companies rely on them to solve various tasks. However, while research has shown how biased and harmful these models are, systematic ways of integrating social bias tests into development pipelines are still lacking. This short paper suggests how to use these verification techniques in development pipelines. We take inspiration from software testing and suggest addressing social bias evaluation as software testing. We hope to open a discussion on the best methodologies to handle social bias testing in language models.
引用
收藏
页码:68 / 74
页数:7
相关论文
共 50 条
  • [1] Bias and Fairness in Large Language Models: A Survey
    Gallegos, Isabel O.
    Rossi, Ryan A.
    Barrow, Joe
    Tanjim, Md Mehrab
    Kim, Sungchul
    Dernoncourt, Franck
    Yu, Tong
    Zhang, Ruiyi
    Ahmed, Nesreen K.
    [J]. COMPUTATIONAL LINGUISTICS, 2024, 50 (03) : 1097 - 1179
  • [2] Gender bias and stereotypes in Large Language Models
    Kotek, Hadas
    Dockum, Rikker
    Sun, David Q.
    [J]. PROCEEDINGS OF THE ACM COLLECTIVE INTELLIGENCE CONFERENCE, CI 2023, 2023, : 12 - 24
  • [3] Do Large Language Models Bias Human Evaluations?
    O'Leary, Daniel E.
    [J]. IEEE INTELLIGENT SYSTEMS, 2024, 39 (04) : 83 - 87
  • [4] Cultural bias and cultural alignment of large language models
    Tao, Yan
    Viberg, Olga
    Baker, Ryan S.
    Kizilcec, Rene F.
    [J]. PNAS NEXUS, 2024, 3 (09):
  • [5] Persistent Anti-Muslim Bias in Large Language Models
    Abid, Abubakar
    Farooqi, Maheen
    Zou, James
    [J]. AIES '21: PROCEEDINGS OF THE 2021 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2021, : 298 - 306
  • [6] Social Value Alignment in Large Language Models
    Abbol, Giulio Antonio
    Marchesi, Serena
    Wykowska, Agnieszka
    Belpaeme, Tony
    [J]. VALUE ENGINEERING IN ARTIFICIAL INTELLIGENCE, VALE 2023, 2024, 14520 : 83 - 97
  • [7] Leveraging Cognitive Science for Testing Large Language Models
    Srinivasan, Ramya
    Inakoshi, Hiroya
    Uchino, Kanji
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 169 - 171
  • [8] Testing theory of mind in large language models and humans
    Strachan, James W. A.
    Albergo, Dalila
    Borghini, Giulia
    Pansardi, Oriana
    Scaliti, Eugenio
    Gupta, Saurabh
    Saxena, Krati
    Rufo, Alessandro
    Panzeri, Stefano
    Manzi, Guido
    Graziano, Michael S. A.
    Becchio, Cristina
    [J]. NATURE HUMAN BEHAVIOUR, 2024, 8 (07): : 1285 - 1295
  • [9] A Survey of Testing Techniques Based on Large Language Models
    Qi, Fei
    Hou, Yingnan
    Lin, Ning
    Bao, Shanshan
    Xu, Nuo
    [J]. PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 280 - 284
  • [10] Implicit bias in large language models: Experimental proof and implications for education
    Warr, Melissa
    Oster, Nicole Jakubczyk
    Isaac, Roger
    [J]. JOURNAL OF RESEARCH ON TECHNOLOGY IN EDUCATION, 2024,