EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

被引:0
|
作者
Zhou, Weikang [1 ]
Wang, Xiao [1 ]
Xiong, Limao [1 ]
Xia, Han [1 ]
Gu, Yingshuang [1 ]
Chai, Mingxu [1 ]
Zhu, Fukang [1 ]
Huang, Caishuang [1 ]
Dou, Shihan [1 ]
Xi, Zhiheng [1 ]
Zheng, Rui [1 ]
Gao, Songyang [3 ]
Zou, Yicheng [3 ]
Yan, Hang [3 ]
Le, Yifan [3 ]
Wang, Ruohui [3 ]
Li, Lijun [3 ]
Shao, Jing [3 ]
Gui, Tao [2 ]
Zhang, Qi [1 ]
Huang, Xuanjing [1 ]
机构
[1] School of Computer Science, Fudan University, Shanghai, China
[2] Institute of Modern Languages and Linguistics, Fudan University, Shanghai, China
[3] Shanghai AI Laboratory, China
来源
关键词
Compilation and indexing terms; Copyright 2025 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [1] Jailbreaking Black Box Large Language Models in Twenty Queries
    Chao, Patrick
    Robey, Alexander
    Dobriban, Edgar
    Hassani, Hamed
    Pappas, George J.
    Wong, Eric
    [J]. arXiv, 2023,
  • [2] Unleashing the Unseen: Harnessing Benign Datasets for Jailbreaking Large Language Models
    Zhao, Wei
    Li, Zhe
    Li, Yige
    Sun, Jun
    [J]. arXiv,
  • [3] Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
    Jia, Xiaojun
    Pang, Tianyu
    Du, Chao
    Huang, Yihao
    Gu, Jindong
    Liu, Yang
    Cao, Xiaochun
    Lin, Min
    [J]. arXiv,
  • [4] JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models
    Jin, Haibo
    Hu, Leyang
    Li, Xinnuo
    Zhang, Peiyan
    Chen, Chonghan
    Zhuang, Jun
    Wang, Haohan
    [J]. arXiv,
  • [5] Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical Thinking
    Xu, Nan
    Wang, Fei
    Zhou, Ben
    Li, Bangzheng
    Xiao, Chaowei
    Chen, Muhao
    [J]. Findings of the Association for Computational Linguistics: NAACL 2024 - Findings, 2024, : 3526 - 3548
  • [6] The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models
    School of Computer Science and Technology, Xidian University, China
    [J]. arXiv,
  • [7] UniMEL: A Unified Framework for Multimodal Entity Linking with Large Language Models
    Liu, Qi
    He, Yongyi
    Xu, Tong
    Lian, Defu
    Liu, Che
    Zheng, Zhi
    Chen, Enhong
    [J]. International Conference on Information and Knowledge Management, Proceedings, : 1909 - 1919
  • [8] Open Sesame! Universal Black-Box Jailbreaking of Large Language Models
    Lapid, Raz
    Langberg, Ron
    Sipper, Moshe
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [9] UniGen: A Unified Generative Framework for Retrieval and Question Answering with Large Language Models
    Li, Xiaoxi
    Zhou, Yujia
    Dou, Zhicheng
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 8688 - 8696
  • [10] Jailbreaking Pre-trained Large Language Models Towards Hardware Vulnerability Insertion Ability
    Wan, Gwok-Waa
    Wong, Sam-Zaak
    Wang, Xi
    [J]. PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 579 - 582