MULTILINGUAL JAILBREAK CHALLENGES IN LARGE LANGUAGE MODELS

被引:0
|
作者
Deng, Yue [1 ,2 ]
Zhang, Wenxuan [1 ,3 ]
Pan, Sinno Jialin [2 ,4 ]
Bing, Lidong [1 ,3 ]
机构
[1] DAMO Academy, Alibaba Group, Singapore
[2] Nanyang Technological University, Singapore
[3] Hupan Lab, Hangzhou,310023, China
[4] The Chinese University of Hong Kong, Hong Kong
来源
arXiv | 2023年
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Computational linguistics
引用
收藏
相关论文
共 50 条
  • [1] Jailbreak Attack for Large Language Models: A Survey
    Li, Nan
    Ding, Yidong
    Jiang, Haoyu
    Niu, Jiafei
    Yi, Ping
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (05): : 1156 - 1181
  • [2] Tastle: Distract Large Language Models for Automatic Jailbreak Attack
    Xiao, Zeguan
    Yang, Yan
    Chen, Guanhua
    Chen, Yun
    [J]. arXiv, 1600,
  • [3] Visual Adversarial Examples Jailbreak Aligned Large Language Models
    Princeton University, United States
    [J]. Proc. AAAI Conf. Artif. Intell., 19 (21527-21536):
  • [4] HARNESSING TASK OVERLOAD FOR SCALABLE JAILBREAK ATTACKS ON LARGE LANGUAGE MODELS
    Dong, Yiting
    Shen, Guobin
    Zhao, Dongcheng
    He, Xiang
    Zeng, Yi
    [J]. arXiv,
  • [5] JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models
    The State Key Lab of CAD&CG, Zhejiang University, China
    不详
    不详
    [J]. arXiv,
  • [6] Align is not Enough: Multimodal Universal Jailbreak Attack against Multimodal Large Language Models
    Wang, Youze
    Hu, Wenbo
    Dong, Yinpeng
    Liu, Jing
    Zhang, Hanwang
    Hong, Richang
    [J]. IEEE Transactions on Circuits and Systems for Video Technology,
  • [7] Defending Large Language Models Against Jailbreak Attacks via Layer-specific Editing
    Zhao, Wei
    Li, Zhe
    Li, Yige
    Zhang, Ye
    Sun, Jun
    [J]. arXiv,
  • [8] Bootstrapping Multilingual Semantic Parsers using Large Language Models
    Awasthi, Abhijeet
    Gupta, Nitish
    Samanta, Bidisha
    Dave, Shachi
    Sarawagi, Sunita
    Talukdar, Partha
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2455 - 2467
  • [9] Multilingual spoken language processing - Challenges for multilingual systems
    Fung, Pascale
    Schultz, Tanja
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (03) : 89 - 97
  • [10] Multilingual Code Co-evolution using Large Language Models
    Zhang, Jiyang
    Nie, Pengyu
    Li, Junyi Jessy
    Gligoric, Milos
    [J]. PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 695 - 707