Generating Natural Language Adversarial Examples on a Large Scale with Generative Models

被引:5
|
作者
Ren, Yankun [1 ]
Lin, Jianbin [1 ]
Tang, Siliang [2 ]
Zhou, Jun [1 ]
Yang, Shuang [1 ]
Qi, Yuan [1 ]
Ren, Xiang [3 ]
机构
[1] Ant Financial Serv Grp, Hangzhou, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
[3] Univ Southern Calif, Los Angeles, CA 90007 USA
关键词
D O I
10.3233/FAIA200340
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Today text classification models have been widely used. However, these classifiers are found to be easily fooled by adversarial examples. Fortunately, standard attacking methods generate adversarial texts in a pair-wise way, that is, an adversarial text can only be created from a real-world text by replacing a few words. In many applications, these texts are limited in numbers, therefore their corresponding adversarial examples are often not diverse enough and sometimes hard to read, thus can be easily detected by humans and cannot create chaos at a large scale. In this paper, we propose an end to end solution to efficiently generate adversarial texts from scratch using generative models, which are not restricted to perturbing the given texts. We call it unrestricted adversarial text generation. Specifically, we train a conditional variational autoencoder (VAE) with an additional adversarial loss to guide the generation of adversarial examples. Moreover, to improve the validity of adversarial texts, we utilize discrimators and the training framework of generative adversarial networks (GANs) to make adversarial texts consistent with real data. Experimental results on sentiment analysis demonstrate the scalability and efficiency of our method. It can attack text classification models with a higher success rate than existing methods, and provide acceptable quality for humans in the meantime.
引用
收藏
页码:2156 / 2163
页数:8
相关论文
共 50 条
  • [21] Towards Generating Structurally Realistic Models by Generative Adversarial Networks
    Rahimi, Abbas
    Tisi, Massimo
    Rahimi, Shekoufeh Kolahdouz
    Berardinelli, Luca
    2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C, 2023, : 597 - 604
  • [22] Robust adversarial examples against scale transformation via generative network
    Liu, Minjie
    Zhang, Xinpeng
    Feng, Guorui
    ELECTRONICS LETTERS, 2022, 58 (07) : 290 - 292
  • [23] Survey on Generating Adversarial Examples
    Pan W.-W.
    Wang X.-Y.
    Song M.-L.
    Chen C.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (01): : 67 - 81
  • [24] CloudLeak: Large-Scale Deep Learning Models Stealing Through Adversarial Examples
    Yu, Honggang
    Yang, Kaichen
    Zhang, Teng
    Tsai, Yun-Yun
    Ho, Tsung-Yi
    Jin, Yier
    27TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2020), 2020,
  • [25] Adversarial Attacks on Large Language Models
    Zou, Jing
    Zhang, Shungeng
    Qiu, Meikang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2024, 2024, 14887 : 85 - 96
  • [26] Generative Large Language Models Explained
    Yan, Xueming
    Xiao, Yan
    Jin, Yaochu
    IEEE Computational Intelligence Magazine, 2024, 19 (04) : 45 - 46
  • [27] Generating Adversarial Examples for Holding Robustness of Source Code Processing Models
    Zhang, Huangzhao
    Li, Zhuo
    Li, Ge
    Ma, Lei
    Liu, Yang
    Jinl, Zhi
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1169 - 1176
  • [28] Natural Adversarial Examples
    Hendrycks, Dan
    Zhao, Kevin
    Basart, Steven
    Steinhardt, Jacob
    Song, Dawn
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15257 - 15266
  • [29] Foundation Models, Generative AI, and Large Language Models
    Ross, Angela
    McGrow, Kathleen
    Zhi, Degui
    Rasmy, Laila
    CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 377 - 387
  • [30] Generative Adversarial Examples for Sequential Text Recognition Models with Artistic Text Style
    Liu, Yanhong
    Cao, Fengming
    Zhang, Yuqi
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 71 - 79