PAT: Geometry-Aware Hard-Label Black-Box Adversarial Attacks on Text

被引：1

作者：

Ye, Muchao ^{[1
]}

Chen, Jinghui ^{[1
]}

Miao, Chenglin ^{[2
]}

Liu, Han ^{[3
]}

Wang, Ting ^{[1
]}

Ma, Fenglong ^{[1
]}

机构：

[1] Penn State Univ, University Pk, PA 16802 USA

[2] Iowa State Univ, Ames, IA USA

[3] Dalian Univ Technol, Dalian, Liaoning, Peoples R China

来源：

PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023 | 2023年

基金：

美国国家科学基金会;

关键词：

hard-label adversarial attack; robustness of language model;

D O I：

10.1145/3580305.3599461

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Despite a plethora of prior explorations, conducting text adversarial attacks in practical settings is still challenging with the following constraints: black box - the inner structure of the victim model is unknown; hard label - the attacker only has access to the top-1 prediction results; and semantic preservation - the perturbation needs to preserve the original semantics. In this paper, we present PAT,1 a novel adversarial attack method employed under all these constraints. Specifically, PAT explicitly models the adversarial and non-adversarial prototypes and incorporates them to measure semantic changes for replacement selection in the hard-label black-box setting to generate high-quality samples. In each iteration, PAT finds original words that can be replaced back and selects better candidate words for perturbed positions in a geometry-aware manner guided by this estimation, which maximally improves the perturbation construction and minimally impacts the original semantics. Extensive evaluation with benchmark datasets and state-of-the-art models shows that PAT outperforms existing text adversarial attacks in terms of both attack effectiveness and semantic preservation. Moreover, we validate the efficacy of PAT against industry-leading natural language processing platforms in real-world settings.

引用

页码：3093 / 3104

页数：12

共 50 条

[41] Black-box attacks against log anomaly detection with adversarial examples
Lu, Siyang
Wang, Mingquan
Wang, Dongdong
Wei, Xiang
Xiao, Sizhe
Wang, Zhiwei
Han, Ningning
Wang, Liqiang
INFORMATION SCIENCES, 2023, 619 : 249 - 262
[42] Efficient Local Imperceptible Random Search for Black-Box Adversarial Attacks
Li, Yining
You, Shu
Chen, Yihan
Li, Zhenhua
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024, 2024, 14872 : 325 - 336
[43] Less is More: Dimension Reduction Finds On-Manifold Adversarial Examples in Hard-Label Attacks
Garcia, Washington
Chen, Pin-Yu
Clouse, Hamilton Scott
Jha, Somesh
Butler, Kevin R. B.
2023 IEEE CONFERENCE ON SECURE AND TRUSTWORTHY MACHINE LEARNING, SATML, 2023, : 254 - 270
[44] Black-box attacks on dynamic graphs via adversarial topology perturbations
Tao, Haicheng
Cao, Jie
Chen, Lei
Sun, Hongliang
Shi, Yong
Zhu, Xingquan
NEURAL NETWORKS, 2024, 171 : 308 - 319
[45] Improving the transferability of adversarial examples through black-box feature attacks
Wang, Maoyuan
Wang, Jinwei
Ma, Bin
Luo, Xiangyang
NEUROCOMPUTING, 2024, 595
[46] Black-box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples
Zhang, Yuekai
Jiang, Ziyan
Villalba, Jesus
Dehak, Najim
INTERSPEECH 2020, 2020, : 4238 - 4242
[47] Adversarial Black-Box Attacks with Timing Side-Channel Leakage
Nakai, Tsunato
Suzuki, Daisuke
Omatsu, Fumio
Fujino, Takeshi
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2021, E104A (01) : 143 - 151
[48] Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information
Zhene, Baolin
Jiang, Peipei
Wang, Qian
Li, Qi
Shen, Chao
Wang, Cong
Ge, Yunjie
Teng, Qingyang
Zhang, Shenyi
CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 86 - 107
[49] Mitigating Black-Box Adversarial Attacks via Output Noise Perturbation
Aithal, Manjushree B.
Li, Xiaohua
IEEE ACCESS, 2022, 10 : 12395 - 12411
[50] Simultaneously Optimizing Perturbations and Positions for Black-Box Adversarial Patch Attacks
Wei, Xingxing
Guo, Ying
Yu, Jie
Zhang, Bo
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 9041 - 9054

← 1 2 3 4 5 →