Towards Decision-based Sparse Attacks on Video Recognition

被引:1
|
作者
Jiang, Kaixun [1 ]
Chen, Zhaoyu [1 ]
Zhou, Xinyu [2 ]
Zhang, Jingyu [1 ]
Hong, Lingyi [2 ]
Li, Bo [2 ,3 ]
Wang, Yan [1 ]
Zhang, Wenqiang [1 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[3] Vivo Mobile Commun Co Ltd, Dongguan, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
adversarial examples; video action recognition; sparse attacks;
D O I
10.1145/3581783.3611828
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies indicate that sparse attacks threaten the security of deep learning models, which modify only a small set of pixels in the input based on the l(0) norm constraint. While existing research has primarily focused on sparse attacks against image models, there is a notable gap in evaluating the robustness of video recognition models. To bridge this gap, we are the first to study sparse video attacks and propose an attack framework named V-DSA in the most challenging decision-based setting, in which threat models only return the predicted hard label. Specifically, V-DSA comprises two modules: a Cross-Modal Generator (CMG) for query-free transfer attacks on each frame and an Optical flow Grouping Evolution algorithm (OGE) for query-efficient spatial-temporal attacks. CMG passes each frame to generate the transfer video as the starting point of the attack based on the feature similarity between image classification and video recognition models. OGE first initializes populations based on transfer video and then leverages optical flow to establish the temporal connection of the perturbed pixels in each frame, which can reduce the parameter space and break the temporal relationship between frames specifically. Finally, OGE complements the above optical flow modeling by grouping evolution which can realize the coarse-to-fine attack to avoid falling into the local optimum. In addition, OGE makes the perturbation with temporal coherence while balancing the number of perturbed pixels per frame, further increasing the imperceptibility of the attack. Extensive experiments demonstrate that V-DSA achieves state-of-the-art performance in terms of both threat effectiveness and imperceptibility. We hope V-DSA can provide valuable insights into the security of video recognition systems.
引用
收藏
页码:1443 / 1454
页数:12
相关论文
共 50 条
  • [1] Efficient Decision-based Black-box Patch Attacks on Video Recognition
    Jiang, Kaixun
    Chen, Zhaoyu
    Huang, Hao
    Wang, Jiafeng
    Yang, Dingkang
    Li, Bo
    Wang, Yan
    Zhang, Wenqiang
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4356 - 4366
  • [2] Efficient Decision-based Black-box Adversarial Attacks on Face Recognition
    Dong, Yinpeng
    Su, Hang
    Wu, Baoyuan
    Li, Zhifeng
    Liu, Wei
    Zhang, Tong
    Zhu, Jun
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7706 - 7714
  • [3] Low-Rank and Sparse Decomposition for Low-Query Decision-Based Adversarial Attacks
    Esmaeili, Ashkan
    Edraki, Marzieh
    Rahnavard, Nazanin
    Mian, Ajmal
    Shah, Mubarak
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 1561 - 1575
  • [4] Decision-based evasion attacks on tree ensemble classifiers
    Zhang, Fuyong
    Wang, Yi
    Liu, Shigang
    Wang, Hua
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (05): : 2957 - 2977
  • [5] Decision-based evasion attacks on tree ensemble classifiers
    Fuyong Zhang
    Yi Wang
    Shigang Liu
    Hua Wang
    [J]. World Wide Web, 2020, 23 : 2957 - 2977
  • [6] AutoDA: Automated Decision-based Iterative Adversarial Attacks
    Fu, Qi-An
    Dong, Yinpeng
    Su, Hang
    Zhu, Jun
    Zhang, Chao
    [J]. PROCEEDINGS OF THE 31ST USENIX SECURITY SYMPOSIUM, 2022, : 3557 - 3574
  • [7] Visual tracking via decision-based particle filtering based on sparse representation
    Farahani, Mohamad Hosein Davoodabadi
    Lotfizad, Mojtaba
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (04)
  • [8] Face recognition/detection by probabilistic decision-based neural network
    Lin, SH
    Kung, SY
    Lin, LJ
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (01): : 114 - 132
  • [9] Enhancing robustness in video recognition models: Sparse adversarial attacks and beyond
    Mu, Ronghui
    Marcolino, Leandro
    Ni, Qiang
    Ruan, Wenjie
    [J]. NEURAL NETWORKS, 2024, 171 : 127 - 143
  • [10] Texture recognition by generalized probabilistic decision-based neural networks
    Xu, Yeong-Yuh
    Tseng, C. -L.
    Fu, Hsin-Chia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 6184 - 6189