Projan: A probabilistic trojan attack on deep neural networks

被引:0
|
作者
Saremi, Mehrin [1 ]
Khalooei, Mohammad [2 ]
Rastgoo, Razieh [3 ]
Sabokrou, Mohammad [4 ,5 ]
机构
[1] Semnan University, Farzanegan Campus, Semnan,35131-19111, Iran
[2] Amirkabir University of Technology, Department of Computer Engineering, Tehran, Iran
[3] Faculty of Electrical and Computer Engineering, Semnan University, Semnan,35131-19111, Iran
[4] Institute for Research in Fundamental Sciences, Tehran, Iran
[5] Okinawa Institute of Science and Technology, Okinawa, Japan
关键词
D O I
10.1016/j.knosys.2024.112565
中图分类号
学科分类号
摘要
Deep neural networks have gained popularity due to their outstanding performance across various domains. However, because of their lack of explainability, they are vulnerable to some kinds of threats including the trojan or backdoor attack, in which an adversary can train the model to respond to a crafted peculiar input pattern (also called trigger) according to their will. Several trojan attack and defense methods have been proposed in the literature. Many of the defense methods are based on the assumption that the possibly existing trigger must be able to affect the model's behavior, making it output a certain class label for all inputs. In this work, we propose an alternative attack method that violates this assumption. Instead of a single trigger that works on all inputs, a few triggers are generated that will affect only some of the inputs. At attack time, the adversary will need to try more than one trigger to succeed, which might be possible in some real-world situations. Our experiments on MNIST and CIFAR-10 datasets show that such an attack can be implemented successfully, reaching an attack success rate similar to baseline methods called BadNet and N-to-One. We also tested wide range of defense methods and verified that in general, this kind of backdoor is more difficult for defense algorithms to detect. The code is available at https://github.com/programehr/Projan. © 2024 Elsevier B.V.
引用
下载
收藏
相关论文
共 50 条
  • [1] An Embarrassingly Simple Approach for Trojan Attack in Deep Neural Networks
    Tang, Ruixiang
    Du, Mengnan
    Liu, Ninghao
    Yang, Fan
    Hu, Xia
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 218 - 228
  • [2] Deep Feature Space Trojan Attack of Neural Networks by Controlled Detoxification
    Cheng, Siyuan
    Liu, Yingqi
    Ma, Shiqing
    Zhang, Xiangyu
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1148 - 1156
  • [3] Amplification trojan network: Attack deep neural networks by amplifying their inherent weakness
    Hu, Zhanhao
    Zhu, Jun
    Zhang, Bo
    Hu, Xiaolin
    NEUROCOMPUTING, 2022, 505 : 142 - 153
  • [4] Live Trojan Attacks on Deep Neural Networks
    Costales, Robby
    Mao, Chengzhi
    Norwitz, Raphael
    Kim, Bryan
    Yang, Junfeng
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3460 - 3469
  • [5] Trojan Attacks and Defenses on Deep Neural Networks
    Liu, Yingqi
    ProQuest Dissertations and Theses Global, 2022,
  • [6] Attack on Deep Steganalysis Neural Networks
    Li, Shiyu
    Ye, Dengpan
    Jiang, Shunzhi
    Liu, Changrui
    Niu, Xiaoguang
    Luo, Xiangyang
    CLOUD COMPUTING AND SECURITY, PT IV, 2018, 11066 : 265 - 276
  • [7] Probabilistic Models with Deep Neural Networks
    Masegosa, Andres R.
    Cabanas, Rafael
    Langseth, Helge
    Nielsen, Thomas D.
    Salmeron, Antonio
    ENTROPY, 2021, 23 (01) : 1 - 27
  • [8] Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips
    Bai, Jiawang
    Gao, Kuofeng
    Gong, Dihong
    Xia, Shu-Tao
    Li, Zhifeng
    Liu, Wei
    arXiv, 2022,
  • [9] Hardly Perceptible Trojan Attack Against Neural Networks with Bit Flips
    Bai, Jiawang
    Gao, Kuofeng
    Gong, Dihong
    Xia, Shu-Tao
    Li, Zhifeng
    Liu, Wei
    COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 104 - 121
  • [10] Shallow Neural Networks to Deep Neural Networks for Probabilistic Wind Forecasting
    Arora, Parul
    Panigrahi, B. K.
    Suganthan, P. N.
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 377 - 382