BFS2Adv: Black-box adversarial attack towards hard-to-attack short texts

被引：0

作者：

Han, Xu ^{[1
]}

Li, Qiang ^{[1
]}

Cao, Hongbo ^{[1
]}

Han, Lei ^{[2
]}

Wang, Bin ^{[3
]}

Bao, Xuhua ^{[4
]}

Han, Yufei ^{[5
]}

Wang, Wei ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Transp, Privacy Intelligent Transportat, Beijing 100044, Peoples R China

[2] Beijing Inst Comp Technol & Applicat, Beijing 100584, Peoples R China

[3] Zhejiang Univ, Zhejiang Key Lab Multidimens Percept Technol Appli, Hangzhou 310027, Peoples R China

[4] Sangfro Technol Inc, Shenzhen 518055, Peoples R China

[5] INRIA, F-35042 Rennes, France

来源：

COMPUTERS & SECURITY | 2024年 / 141卷

基金：

国家重点研发计划; 中国国家自然科学基金; 北京市自然科学基金;

关键词：

Text classification; Adversarial attack; Score-based adversarial attack; Hard-to-attack examples;

D O I：

10.1016/j.cose.2024.103817

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The advent of Machine Learning as a Service (MLaaS) and deep learning applications has increased the susceptibility of models to adversarial textual attacks, particularly in black -box settings. Prior work on black -box adversarial textual attacks generally follows a stable strategy that involves leveraging char-level, world-level, and sentence-level perturbations, as well as using queries to the target model to find adversarial examples in the search space. However, existing approaches prioritize query efficiency by reducing the search space, thereby overlooking hard-to-attack textual instances. To address this issue, we propose BFS2Adv , a brute force algorithm that generates adversarial examples for both easy-to-attack and hard-to-attack textual inputs. BFS2Adv, starting with an original text, employs word-level perturbations and synonym substitution to construct a comprehensive search space, with each node representing a potential adversarial example. The algorithm systematically explores this space through a breadth-first search, combined with queries to the target model, to effectively identify qualified adversarial examples. We implemented and evaluated a prototype of BFS2Adv against renowned models such as ALBERT and BERT, utilizing the SNLI and MR datasets. Our results demonstrate that BFS2Adv outperforms state -of -the -art algorithms and effectively improves the success rate of short -text adversarial attacks. Furthermore, we provide detailed insights into the robustness of BFS2Adv by analyzing those hard-to-attack examples.

引用

页数：12

共 50 条

[1] Saliency Attack: Towards Imperceptible Black-box Adversarial Attack
Dai, Zeyu
Liu, Shengcai
Li, Qing
Tang, Ke
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (03)
[2] SIMULATOR ATTACK plus FOR BLACK-BOX ADVERSARIAL ATTACK
Ji, Yimu
Ding, Jianyu
Chen, Zhiyu
Wu, Fei
Zhang, Chi
Sun, Yiming
Sun, Jing
Liu, Shangdong
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 636 - 640
[3] Towards Efficient Data Free Black-box Adversarial Attack
Zhang, Jie
Li, Bo
Xu, Jianghe
Wu, Shuang
Ding, Shouhong
Zhang, Lei
Wu, Chao
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15094 - 15104
[4] Hard-label Black-box Universal Adversarial Patch Attack
Tao, Guanhong
An, Shengwei
Cheng, Siyuan
Shen, Guangyu
Zhang, Xiangyu
[J]. PROCEEDINGS OF THE 32ND USENIX SECURITY SYMPOSIUM, 2023, : 697 - 714
[5] Amora: Black-box Adversarial Morphing Attack
Wang, Run
Juefei-Xu, Felix
Guo, Qing
Huang, Yihao
Xie, Xiaofei
Ma, Lei
Liu, Yang
[J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1376 - 1385
[6] Adversarial Eigen Attack on Black-Box Models
Zhou, Linjun
Cui, Peng
Zhang, Xingxuan
Jiang, Yinan
Yang, Shiqiang
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15233 - 15241
[7] A black-Box adversarial attack for poisoning clustering
Cina, Antonio Emanuele
Torcinovich, Alessandro
Pelillo, Marcello
[J]. PATTERN RECOGNITION, 2022, 122
[8] IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking
Jia, Shuai
Song, Yibing
Ma, Chao
Yang, Xiaokang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6705 - 6714
[9] A Hard Label Black-box Adversarial Attack Against Graph Neural Networks
Mu, Jiaming
Wang, Binghui
Li, Qi
Sun, Kun
Xu, Mingwei
Liu, Zhuotao
[J]. CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 108 - 125
[10] A black-box adversarial attack on demand side management
Cramer, Eike
Gao, Ji
[J]. COMPUTERS & CHEMICAL ENGINEERING, 2024, 186

← 1 2 3 4 5 →