Synthesising Audio Adversarial Examples for Automatic Speech Recognition

被引:4
|
作者
Qu, Xinghua [1 ]
Wei, Pengfei [1 ]
Gao, Mingyong [2 ]
Sun, Zhu [3 ]
Ong, Yew-Soon [4 ]
Ma, Zejun [1 ]
机构
[1] Bytedance AI Lab, Speech & Audio Team, Singapore, Singapore
[2] Univ Sci & Technol China, Hefei, Peoples R China
[3] ASTAR, Inst High Performance Comp & Ctr Frontier AI Res, Singapore, Singapore
[4] Nanyang Technol Univ, A STAR, Ctr Frontier AI Res, Singapore, Singapore
关键词
Adversarial Attack; Automatic Speech Recognition; Speech Synthesis;
D O I
10.1145/3534678.3539268
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adversarial examples in automatic speech recognition (ASR) are naturally sounded by humans yet capable of fooling well trained ASR models to transcribe incorrectly. Existing audio adversarial examples are typically constructed by adding constrained perturbations on benign audio inputs. Such attacks are therefore generated with an audio dependent assumption. For the first time, we propose the Speech Synthesising based Attack (SSA), a novel threat model that constructs audio adversarial examples entirely from scratch, i.e., without depending on any existing audio to fool cutting-edge ASR models. To this end, we introduce a conditional variational auto-encoder (CVAE) as the speech synthesiser. Meanwhile, an adaptive sign gradient descent algorithm is proposed to solve the adversarial audio synthesis task. Experiments on three datasets (i.e., Audio Mnist, Common Voice, and Librispeech) show that our method could synthesise naturally sounded audio adversarial examples to mislead the start-of-the-art ASR models. Our web-page containing generated audio demos is at https://sites.google.com/view/ssa-asr/home.
引用
收藏
页码:1430 / 1440
页数:11
相关论文
共 50 条
  • [1] Towards Visualizing and Detecting Audio Adversarial Examples for Automatic Speech Recognition
    Zong, Wei
    Chow, Yang-Wai
    Susilo, Willy
    INFORMATION SECURITY AND PRIVACY, ACISP 2021, 2021, 13083 : 531 - 549
  • [2] Noise Flooding for Detecting Audio Adversarial Examples Against Automatic Speech Recognition
    Rajaratnam, Krishan
    Kalita, Jugal
    2018 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2018, : 197 - 201
  • [3] A Detection Algorithm for Audio Adversarial Examples in EI-Enhanced Automatic Speech Recognition
    Huang, Ying
    Liu, Jie
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [4] Detecting Audio Adversarial Examples in Automatic Speech Recognition Systems Using Decision Boundary Patterns
    Zong, Wei
    Chow, Yang-Wai
    Susilo, Willy
    Kim, Jongkil
    Le, Ngoc Thuy
    JOURNAL OF IMAGING, 2022, 8 (12)
  • [5] Adversarial Examples for Automatic Speech Recognition: Attacks and Countermeasures
    Hu, Shengshan
    Shang, Xingcan
    Qin, Zhan
    Li, Minghui
    Wang, Qian
    Wang, Cong
    IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (10) : 120 - 126
  • [6] Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition
    Qin, Yao
    Carlini, Nicholas
    Goodfellow, Ian
    Cottrell, Garrison
    Raffel, Colin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] AudioGuard: Speech Recognition System Robust against Optimized Audio Adversarial Examples
    Kwon, Hyun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (20) : 57943 - 57962
  • [8] Model Access Control Based on Hidden Adversarial Examples for Automatic Speech Recognition
    Chen H.
    Zhang J.
    Chen K.
    Zhang W.
    Yu N.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (03): : 1302 - 1315
  • [9] Acoustic-decoy: Detection of adversarial examples through audio modification on speech recognition system
    Kwon, Hyun
    Yoon, Hyunsoo
    Park, Ki-Woong
    NEUROCOMPUTING, 2020, 417 (417) : 357 - 370
  • [10] IMPERIO: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems
    Schoenherr, Lea
    Eisenhofer, Thorsten
    Zeiler, Steffen
    Holz, Thorsten
    Kolossa, Dorothea
    36TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE (ACSAC 2020), 2020, : 843 - 855