AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks

被引:0
|
作者
Zeng, Yifan [1 ]
Wu, Yiran [2 ]
Zhang, Xiao [3 ]
Wang, Huazheng [1 ]
Wu, Qingyun [2 ]
机构
[1] Oregon State University, United States
[2] Pennsylvania State University, United States
[3] CISPA Helmholtz Center for Information Security, Germany
来源
arXiv |
关键词
Agent systems - Filtering mechanism - Language model - Large models - Model agents - Multi agent - Open-source - Performance - Pre-training;
D O I
暂无
中图分类号
学科分类号
摘要
56
引用
收藏
相关论文
共 50 条
  • [41] A survey on LLM-based multi-agent systems: workflow, infrastructure, and challenges
    Xinyi Li
    Sai Wang
    Siqi Zeng
    Yu Wu
    Yi Yang
    Vicinagearth, 1 (1):
  • [42] Adaptive Partitioning for Coordinated Multi-agent Perimeter Defense
    Macharet, Douglas G.
    Chen, Austin K.
    Shishika, Daigo
    Pappas, George J.
    Kumar, Vijay
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 7971 - 7977
  • [43] Adaptation of multi-agent systems for power infrastructure defense
    Liu, CC
    2001 IEEE POWER ENGINEERING SOCIETY WINTER MEETING, CONFERENCE PROCEEDINGS, VOLS 1-3, 2001, : 154 - 154
  • [44] Immune multi-agent network intrusion Defense model
    Hu, Qiang
    Qiu, JianChuan
    Song, Gang
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2007, : 205 - +
  • [45] Defense and homeland security applications of multi-agent simulations
    Lucas, Thomas W.
    Sanchez, Susan M.
    Martinez, Felix
    Sickinger, Lisa R.
    Roginski, Jonathan W.
    PROCEEDINGS OF THE 2007 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2007, : 126 - +
  • [46] MACS: Multi-Agent COTR System for defense contracting
    Liebowitz, J
    Adya, M
    Rubenstein-Montano, B
    Yoon, V
    Buchwalter, J
    Imhoff, M
    Baek, S
    Suen, C
    KNOWLEDGE-BASED SYSTEMS, 2000, 13 (05) : 241 - 250
  • [47] Multi-agent based Model of DDoS Active Defense
    Zhang Ming-qing
    Liu Xiao-hu
    Cheng Jian
    Fan Tao
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 160 - 164
  • [48] A depth defense model of multi-agent based on immunity
    Jiang, Yaping
    Li, Tao
    Yang, Jin
    Wang, TieFang
    Zhou, Jianhua
    Liang, Gang
    Xu, ChunLin
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2007, 14 : 178 - 185
  • [49] Defense Against Multi-target Trojan Attacks
    Harikumar, Haripriya
    Rana, Santu
    Do, Kien
    Gupta, Sunil
    Zong, Wei
    Susilo, Willy
    Venkastesh, Svetha
    arXiv, 2022,
  • [50] Resilient synchronization of distributed multi-agent systems under attacks
    Mustafa, Aquib
    Modares, Hamidreza
    Moghadam, Rohollah
    AUTOMATICA, 2020, 115