Towards Decision-based Sparse Attacks on Video Recognition

被引：1

作者：

Jiang, Kaixun ^{[1
]}

Chen, Zhaoyu ^{[1
]}

Zhou, Xinyu ^{[2
]}

Zhang, Jingyu ^{[1
]}

Hong, Lingyi ^{[2
]}

Li, Bo ^{[2
,3
]}

Wang, Yan ^{[1
]}

Zhang, Wenqiang ^{[1
]}

机构：

[1] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China

[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China

[3] Vivo Mobile Commun Co Ltd, Dongguan, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

adversarial examples; video action recognition; sparse attacks;

D O I：

10.1145/3581783.3611828

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent studies indicate that sparse attacks threaten the security of deep learning models, which modify only a small set of pixels in the input based on the l(0) norm constraint. While existing research has primarily focused on sparse attacks against image models, there is a notable gap in evaluating the robustness of video recognition models. To bridge this gap, we are the first to study sparse video attacks and propose an attack framework named V-DSA in the most challenging decision-based setting, in which threat models only return the predicted hard label. Specifically, V-DSA comprises two modules: a Cross-Modal Generator (CMG) for query-free transfer attacks on each frame and an Optical flow Grouping Evolution algorithm (OGE) for query-efficient spatial-temporal attacks. CMG passes each frame to generate the transfer video as the starting point of the attack based on the feature similarity between image classification and video recognition models. OGE first initializes populations based on transfer video and then leverages optical flow to establish the temporal connection of the perturbed pixels in each frame, which can reduce the parameter space and break the temporal relationship between frames specifically. Finally, OGE complements the above optical flow modeling by grouping evolution which can realize the coarse-to-fine attack to avoid falling into the local optimum. In addition, OGE makes the perturbation with temporal coherence while balancing the number of perturbed pixels per frame, further increasing the imperceptibility of the attack. Extensive experiments demonstrate that V-DSA achieves state-of-the-art performance in terms of both threat effectiveness and imperceptibility. We hope V-DSA can provide valuable insights into the security of video recognition systems.

引用

页码：1443 / 1454

页数：12

共 50 条

[1] Efficient Decision-based Black-box Patch Attacks on Video Recognition
Jiang, Kaixun
Chen, Zhaoyu
Huang, Hao
Wang, Jiafeng
Yang, Dingkang
Li, Bo
Wang, Yan
Zhang, Wenqiang
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4356 - 4366
[2] Efficient Decision-based Black-box Adversarial Attacks on Face Recognition
Dong, Yinpeng
Su, Hang
Wu, Baoyuan
Li, Zhifeng
Liu, Wei
Zhang, Tong
Zhu, Jun
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7706 - 7714
[3] Low-Rank and Sparse Decomposition for Low-Query Decision-Based Adversarial Attacks
Esmaeili, Ashkan
Edraki, Marzieh
Rahnavard, Nazanin
Mian, Ajmal
Shah, Mubarak
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 1561 - 1575
[4] Decision-based evasion attacks on tree ensemble classifiers
Zhang, Fuyong
Wang, Yi
Liu, Shigang
Wang, Hua
[J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (05): : 2957 - 2977
[5] Decision-based evasion attacks on tree ensemble classifiers
Fuyong Zhang
Yi Wang
Shigang Liu
Hua Wang
[J]. World Wide Web, 2020, 23 : 2957 - 2977
[6] AutoDA: Automated Decision-based Iterative Adversarial Attacks
Fu, Qi-An
Dong, Yinpeng
Su, Hang
Zhu, Jun
Zhang, Chao
[J]. PROCEEDINGS OF THE 31ST USENIX SECURITY SYMPOSIUM, 2022, : 3557 - 3574
[7] Visual tracking via decision-based particle filtering based on sparse representation
Farahani, Mohamad Hosein Davoodabadi
Lotfizad, Mojtaba
[J]. JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (04)
[8] Face recognition/detection by probabilistic decision-based neural network
Lin, SH
Kung, SY
Lin, LJ
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (01): : 114 - 132
[9] Enhancing robustness in video recognition models: Sparse adversarial attacks and beyond
Mu, Ronghui
Marcolino, Leandro
Ni, Qiang
Ruan, Wenjie
[J]. NEURAL NETWORKS, 2024, 171 : 127 - 143
[10] Texture recognition by generalized probabilistic decision-based neural networks
Xu, Yeong-Yuh
Tseng, C. -L.
Fu, Hsin-Chia
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 6184 - 6189

← 1 2 3 4 5 →