Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation

被引:0
|
作者
University of Calabria, Italy [1 ]
机构
来源
arXiv | 1600年
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Artificial intelligence
引用
收藏
相关论文
共 50 条
  • [1] Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation
    Cantini, Riccardo
    Cosenza, Giada
    Orsino, Alessio
    Talia, Domenico
    DISCOVERY SCIENCE, DS 2024, PT I, 2025, 15243 : 52 - 68
  • [2] Assessing political bias in large language models
    Rettenberger, Luca
    Reischl, Markus
    Schutera, Mark
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2025, 8 (02):
  • [3] TERMS OF EQUALITY - A GUIDE TO BIAS-FREE LANGUAGE
    PICKENS, JE
    PERSONNEL JOURNAL, 1985, 64 (08) : 24 - &
  • [4] Visual Adversarial Examples Jailbreak Aligned Large Language Models
    Princeton University, United States
    Proc. AAAI Conf. Artif. Intell., 19 (21527-21536):
  • [5] Visual Adversarial Examples Jailbreak Aligned Large Language Models
    Qi, Xiangyu
    Huang, Kaixuan
    Panda, Ashwinee
    Henderson, Peter
    Wang, Mengdi
    Mittal, Prateek
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21527 - 21536
  • [6] Bias-free language in research as a tool to prevent ageism
    Aquino, Marcos Paulo Miranda de
    Cristina, Elisangela
    Hernandes, Ramos
    Alfonsi, Maynara do Amaral
    Perracini, Monica
    BRAZILIAN JOURNAL OF PHYSICAL THERAPY, 2025, 29 (03)
  • [7] Don’t Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
    Yu, Zhiyuan
    Liu, Xiaogeng
    Liang, Shunning
    Cameron, Zach
    Xiao, Chaowei
    Zhang, Ning
    arXiv,
  • [8] Assessing the Risk of Bias in Randomized Clinical Trials With Large Language Models
    Lai, Honghao
    Ge, Long
    Sun, Mingyao
    Pan, Bei
    Huang, Jiajie
    Hou, Liangying
    Yang, Qiuyu
    Liu, Jiayi
    Liu, Jianing
    Ye, Ziying
    Xia, Danni
    Zhao, Weilong
    Wang, Xiaoman
    Liu, Ming
    Talukdar, Jhalok Ronjan
    Tian, Jinhui
    Yang, Kehu
    Estill, Janne
    JAMA NETWORK OPEN, 2024, 7 (05) : E2412687
  • [10] Bias-Free Language: LGBTQ+Clients and the New APA Manual
    Noble, Nicole
    Bradley, Loretta
    Hendricks, Bret
    JOURNAL OF LGBTQ ISSUES IN COUNSELING, 2021, 15 (01): : 128 - 139