Attention-guided Adversarial Attack for Video Object Segmentation

被引:0
|
作者
Yao, Rui [1 ,2 ]
Chen, Ying [1 ,2 ]
Zhou, Yong [1 ,2 ]
Hu, Fuyuan [3 ]
Zhao, Jiaqi [1 ]
Liu, Bing [1 ]
Shao, Zhiwen [1 ]
机构
[1] China Univ Mining & Technol, Sch Comp Sci & Technol, 1 Daxue Rd, Xuzhou, Jiangsu, Peoples R China
[2] Minist Educ, Engn Res Ctr Mine Digitizat, 1 Daxue Rd, Xuzhou 221116, Jiangsu, Peoples R China
[3] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, 188 Renai Rd, Suzhou 215009, Peoples R China
基金
中国国家自然科学基金;
关键词
Video object segmentation; adversarial attack; attention-guided; deconvolution network;
D O I
10.1145/3617067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video Object Segmentation (VOS) methods have made many breakthroughs with the help of the continuous development and advancement of deep learning. However, the deep learning model is vulnerable to malicious adversarial attacks, which mislead the model to make wrong decisions by adding adversarial perturbation that humans cannot perceive to the input image. Threats to deep learning models remind us that video object segmentation methods are also vulnerable to attacks, thereby threatening their security. Therefore, we study adversarial attacks on the VOS task to better identify the vulnerabilities of the VOS method, which in turn provides an opportunity to improve its robustness. In this paper, we propose an attention-guided adversarial attack method, which uses spatial attention blocks to capture features with global dependencies to construct correlations between consecutive video frames, and performs multipath aggregation to effectively integrate spatial-temporal perturbation, thereby guiding the deconvolution network to generate adversarial examples with strong attack capability. Specifically, the class loss function is designed to enable the deconvolution network to better activate noise in other regions and suppress the activation related to the object class based on the enhanced feature map of the object class. At the same time, attentional feature loss is designed to enhance the transferability against attack. The experimental results on the DAVIS dataset show that the proposed attention-guided adversarial attack method can significantly reduce the segmentation accuracy of OSVOS, and the J&F mean on DAVIS 2016 can reach 73.6 % drop rate. The generated adversarial examples are also highly transferable to other video object segmentation models.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Attention-Guided Memory Model for Video Object Segmentation
    Lin, Yunjian
    Tan, Yihua
    Communications in Computer and Information Science, 2022, 1566 CCIS : 67 - 85
  • [2] Attention-Guided Network for Semantic Video Segmentation
    Li, Jiangyun
    Zhao, Yikai
    Fu, Jun
    Wu, Jiajia
    Liu, Jing
    IEEE ACCESS, 2019, 7 : 140680 - 140689
  • [3] Attention-guided Temporally Coherent Video Object Matting
    Zhang, Yunke
    Wang, Chi
    Cui, Miaomiao
    Ren, Peiran
    Xie, Xuansong
    Hua, Xian-Sheng
    Bao, Hujun
    Huang, Qixing
    Xu, Weiwei
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5128 - 5137
  • [4] Video Sparse Transformer With Attention-Guided Memory for Video Object Detection
    Fujitake, Masato
    Sugimoto, Akihiro
    IEEE ACCESS, 2022, 10 : 65886 - 65900
  • [5] Attention-Guided Disentangled Feature Aggregation for Video Object Detection
    Muralidhara, Shishir
    Hashmi, Khurram Azeem
    Pagani, Alain
    Liwicki, Marcus
    Stricker, Didier
    Afzal, Muhammad Zeshan
    SENSORS, 2022, 22 (21)
  • [6] Visual Attention Guided Video Object Segmentation
    Liang, Hao
    Tan, Yihua
    PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 345 - 349
  • [7] AOSVSSNet: Attention-Guided Optical Satellite Video Smoke Segmentation Network
    Wang, Taoyang
    Hong, Jianzhi
    Han, Yuqi
    Zhang, Guo
    Chen, Shili
    Dong, Tiancheng
    Yang, Yapeng
    Ruan, Hang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 8552 - 8566
  • [8] Guided Slot Attention for Unsupervised Video Object Segmentation
    Lee, Minhyeok
    Cho, Suhwan
    Lee, Dogyoon
    Park, Chaewon
    Lee, Jungho
    Lee, Sangyoun
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 3807 - 3816
  • [9] Attention-guided transformation-invariant attack for black-box adversarial examples
    Zhu, Jiaqi
    Dai, Feng
    Yu, Lingyun
    Xie, Hongtao
    Wang, Lidong
    Wu, Bo
    Zhang, Yongdong
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (05) : 3142 - 3165
  • [10] Attention-guided Unified Network for Panoptic Segmentation
    Li, Yanwei
    Chen, Xinze
    Zhu, Zheng
    Xie, Lingxi
    Huang, Guan
    Du, Dalong
    Wang, Xingang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7019 - 7028