The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models

被引:0
|
作者
School of Computer Science and Technology, Xidian University, China [1 ]
机构
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [1] EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
    Zhou, Weikang
    Wang, Xiao
    Xiong, Limao
    Xia, Han
    Gu, Yingshuang
    Chai, Mingxu
    Zhu, Fukang
    Huang, Caishuang
    Dou, Shihan
    Xi, Zhiheng
    Zheng, Rui
    Gao, Songyang
    Zou, Yicheng
    Yan, Hang
    Le, Yifan
    Wang, Ruohui
    Li, Lijun
    Shao, Jing
    Gui, Tao
    Zhang, Qi
    Huang, Xuanjing
    [J]. arXiv,
  • [2] Jailbreaking Black Box Large Language Models in Twenty Queries
    Chao, Patrick
    Robey, Alexander
    Dobriban, Edgar
    Hassani, Hamed
    Pappas, George J.
    Wong, Eric
    [J]. arXiv, 2023,
  • [3] Unleashing the Unseen: Harnessing Benign Datasets for Jailbreaking Large Language Models
    Zhao, Wei
    Li, Zhe
    Li, Yige
    Sun, Jun
    [J]. arXiv,
  • [4] Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
    Jia, Xiaojun
    Pang, Tianyu
    Du, Chao
    Huang, Yihao
    Gu, Jindong
    Liu, Yang
    Cao, Xiaochun
    Lin, Min
    [J]. arXiv,
  • [5] JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models
    Jin, Haibo
    Hu, Leyang
    Li, Xinnuo
    Zhang, Peiyan
    Chen, Chonghan
    Zhuang, Jun
    Wang, Haohan
    [J]. arXiv,
  • [6] Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical Thinking
    Xu, Nan
    Wang, Fei
    Zhou, Ben
    Li, Bangzheng
    Xiao, Chaowei
    Chen, Muhao
    [J]. Findings of the Association for Computational Linguistics: NAACL 2024 - Findings, 2024, : 3526 - 3548
  • [7] Radiology in the era of large language models: the near and the dark side of the moon
    Pilar López-Úbeda
    Teodoro Martín-Noguerol
    Antonio Luna
    [J]. European Radiology, 2023, 33 : 9455 - 9457
  • [8] Radiology in the era of large language models: the near and the dark side of the moon
    Lopez-Ubeda, Pilar
    Martin-Noguerol, Teodoro
    Luna, Antonio
    [J]. EUROPEAN RADIOLOGY, 2023, 33 (12) : 9455 - 9457
  • [9] Open Sesame! Universal Black-Box Jailbreaking of Large Language Models
    Lapid, Raz
    Langberg, Ron
    Sipper, Moshe
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [10] Jailbreaking Pre-trained Large Language Models Towards Hardware Vulnerability Insertion Ability
    Wan, Gwok-Waa
    Wong, Sam-Zaak
    Wang, Xi
    [J]. PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 579 - 582