Generating Natural Language Adversarial Examples on a Large Scale with Generative Models

被引：5

作者：

Ren, Yankun ^{[1
]}

Lin, Jianbin ^{[1
]}

Tang, Siliang ^{[2
]}

Zhou, Jun ^{[1
]}

Yang, Shuang ^{[1
]}

Qi, Yuan ^{[1
]}

Ren, Xiang ^{[3
]}

机构：

[1] Ant Financial Serv Grp, Hangzhou, Peoples R China

[2] Zhejiang Univ, Hangzhou, Peoples R China

[3] Univ Southern Calif, Los Angeles, CA 90007 USA

来源：

ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2020年 / 325卷

关键词：

D O I：

10.3233/FAIA200340

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Today text classification models have been widely used. However, these classifiers are found to be easily fooled by adversarial examples. Fortunately, standard attacking methods generate adversarial texts in a pair-wise way, that is, an adversarial text can only be created from a real-world text by replacing a few words. In many applications, these texts are limited in numbers, therefore their corresponding adversarial examples are often not diverse enough and sometimes hard to read, thus can be easily detected by humans and cannot create chaos at a large scale. In this paper, we propose an end to end solution to efficiently generate adversarial texts from scratch using generative models, which are not restricted to perturbing the given texts. We call it unrestricted adversarial text generation. Specifically, we train a conditional variational autoencoder (VAE) with an additional adversarial loss to guide the generation of adversarial examples. Moreover, to improve the validity of adversarial texts, we utilize discrimators and the training framework of generative adversarial networks (GANs) to make adversarial texts consistent with real data. Experimental results on sentiment analysis demonstrate the scalability and efficiency of our method. It can attack text classification models with a higher success rate than existing methods, and provide acceptable quality for humans in the meantime.

引用

页码：2156 / 2163

页数：8

共 50 条

[21] Towards Generating Structurally Realistic Models by Generative Adversarial Networks
Rahimi, Abbas
Tisi, Massimo
Rahimi, Shekoufeh Kolahdouz
Berardinelli, Luca
2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C, 2023, : 597 - 604
[22] Robust adversarial examples against scale transformation via generative network
Liu, Minjie
Zhang, Xinpeng
Feng, Guorui
ELECTRONICS LETTERS, 2022, 58 (07) : 290 - 292
[23] Survey on Generating Adversarial Examples
Pan W.-W.
Wang X.-Y.
Song M.-L.
Chen C.
Ruan Jian Xue Bao/Journal of Software, 2020, 31 (01): : 67 - 81
[24] CloudLeak: Large-Scale Deep Learning Models Stealing Through Adversarial Examples
Yu, Honggang
Yang, Kaichen
Zhang, Teng
Tsai, Yun-Yun
Ho, Tsung-Yi
Jin, Yier
27TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2020), 2020,
[25] Adversarial Attacks on Large Language Models
Zou, Jing
Zhang, Shungeng
Qiu, Meikang
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2024, 2024, 14887 : 85 - 96
[26] Generative Large Language Models Explained
Yan, Xueming
Xiao, Yan
Jin, Yaochu
IEEE Computational Intelligence Magazine, 2024, 19 (04) : 45 - 46
[27] Generating Adversarial Examples for Holding Robustness of Source Code Processing Models
Zhang, Huangzhao
Li, Zhuo
Li, Ge
Ma, Lei
Liu, Yang
Jinl, Zhi
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1169 - 1176
[28] Natural Adversarial Examples
Hendrycks, Dan
Zhao, Kevin
Basart, Steven
Steinhardt, Jacob
Song, Dawn
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15257 - 15266
[29] Foundation Models, Generative AI, and Large Language Models
Ross, Angela
McGrow, Kathleen
Zhi, Degui
Rasmy, Laila
CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 377 - 387
[30] Generative Adversarial Examples for Sequential Text Recognition Models with Artistic Text Style
Liu, Yanhong
Cao, Fengming
Zhang, Yuqi
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 71 - 79

← 1 2 3 4 5 →