Personalization as a Shortcut for Few-Shot Backdoor Attack against Text-to-Image Diffusion Models

被引:0
|
作者
Huang, Yihao [1 ]
Juefei-Xu, Felix [2 ]
Guo, Qing [3 ,4 ]
Zhang, Jie [1 ]
Wu, Yutong [1 ]
Hu, Ming [1 ]
Li, Tianlin [1 ]
Pu, Geguang [5 ]
Liu, Yang [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] New York Univ, New York, NY USA
[3] Agcy Sci Technol & Res STAR, IHPC, Singapore, Singapore
[4] CFAR, Singapore, Singapore
[5] East China Normal Univ, Shanghai, Peoples R China
基金
新加坡国家研究基金会;
关键词
ADVERSARIAL ROBUSTNESS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although recent personalization methods have democratized high-resolution image synthesis by enabling swift concept acquisition with minimal examples and lightweight computation, they also present an exploitable avenue for highly accessible backdoor attacks. This paper investigates a critical and unexplored aspect of text-to-image (T2I) diffusion models their potential vulnerability to backdoor attacks via personalization. By studying the prompt processing of popular personalization methods (epitomized by Textual Inversion and DreamBooth), we have devised dedicated personalization-based backdoor attacks according to the different ways of dealing with unseen tokens and divide them into two families: nouveau-token and legacy-token backdoor attacks. In comparison to conventional backdoor attacks involving the fine-tuning of the entire text-to-image diffusion model, our proposed personalization-based backdoor attack method can facilitate more tailored, efficient, and few-shot attacks. Through comprehensive empirical study, we endorse the utilization of the nouveau-token backdoor attack due to its impressive effectiveness, stealthiness, and integrity, markedly outperforming the legacy-token backdoor attack.
引用
收藏
页码:21169 / 21178
页数:10
相关论文
共 50 条
  • [1] BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models
    Vice, Jordan
    Akhtar, Naveed
    Hartley, Richard
    Mian, Ajmal
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 4865 - 4880
  • [2] Prompt suffix-attack against text-to-image diffusion models
    Xiong, Siyun
    Du, Yanhui
    Wang, Zhuohao
    Sun, Peiqi
    NEUROCOMPUTING, 2025, 630
  • [3] Optimizing Prompts Using In-Context Few-Shot Learning for Text-to-Image Generative Models
    Lee, Seunghun
    Lee, Jihoon
    Bae, Chan Ho
    Choi, Myung-Seok
    Lee, Ryong
    Ahn, Sangtae
    IEEE ACCESS, 2024, 12 : 2660 - 2673
  • [4] Text-to-Image Diffusion Models are Zero-Shot Classifiers
    Clark, Kevin
    Jaini, Priyank
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Ambiguity attack against text-to-image diffusion model watermarking
    Yuan, Zihan
    Li, Li
    Wang, Zichi
    Zhang, Xinpeng
    SIGNAL PROCESSING, 2024, 221
  • [6] HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models
    Ruiz, Nataniel
    Li, Yuanzhen
    Jampani, Varun
    Wei, Wei
    Hou, Tingbo
    Pritch, Yael
    Wadhwa, Neal
    Rubinstein, Michael
    Aberman, Kfir
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 6527 - 6536
  • [7] Defending Pre-trained Language Models as Few-shot Learners against Backdoor Attacks
    Xi, Zhaohan
    Du, Tianyu
    Li, Changjiang
    Pang, Ren
    Ji, Shouling
    Chen, Jinghui
    Ma, Fenglong
    Wang, Ting
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Debiasing Text-to-Image Diffusion Models
    He, Ruifei
    Xue, Chuhui
    Tan, Haoru
    Zhang, Wenqing
    Yu, Yingchen
    Bai, Song
    Qi, Xiaojuan
    PROCEEDINGS OF THE 1ST ACM MULTIMEDIA WORKSHOP ON MULTI-MODAL MISINFORMATION GOVERNANCE IN THE ERA OF FOUNDATION MODELS, MIS 2024, 2024, : 29 - 36
  • [9] Zero-shot spatial layout conditioning for text-to-image diffusion models
    Couairon, Guillaume
    Careil, Marlene
    Cord, Matthieu
    Lathuiliere, Stephane
    Verbeek, Jakob
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2174 - 2183
  • [10] Few-shot biomedical image segmentation using diffusion models: Beyond image generation
    Khosravi, Bardia
    Rouzrokh, Pouria
    Mickley, John P.
    Faghani, Shahriar
    Mulford, Kellen
    Yang, Linjun
    Larson, A. Noelle
    Howe, Benjamin M.
    Erickson, Bradley J.
    Taunton, Michael J.
    Wyles, Cody C.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242