Adversarial Attacks on Automatic Speech Recognition (ASR): A Survey

被引:1
|
作者
Bhanushali, Amisha Rajnikant [1 ]
Mun, Hyunjun [1 ]
Yun, Joobeom [1 ]
机构
[1] Sejong Univ, Dept Comp & Informat Secur & Convergence Engn Inte, Seoul 05006, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Surveys; Taxonomy; Automatic speech recognition; Task analysis; Internet; Text recognition; Adversarial machine learning; Artificial neural networks; Adversarial attacks; adversarial samples; automatic speech recognition (ASR); deep neural network (DNN); COEFFICIENTS; EXAMPLES;
D O I
10.1109/ACCESS.2024.3416965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic Speech Recognition (ASR) systems have improved and eased how humans interact with devices. ASR system converts an acoustic waveform into the relevant text form. Modern ASR inculcates deep neural networks (DNNs) to provide faster and better results. As the use of DNN continues to expand, there is a need for examination against various adversarial attacks. Adversarial attacks are synthetic samples crafted carefully by adding particular noise to legitimate examples. They are imperceptible, yet they prove catastrophic to DNNs. Recently, adversarial attacks on ASRs have increased but previous surveys lack generalization of the different methods used for attacking ASR, and the scope of the study is narrowed to a particular application, making it difficult to determine the relationships and trade-offs between the attack techniques. Therefore, this survey provides a taxonomy illustrating the classification of the adversarial attacks on ASR based on their characteristics and behavior. Additionally, we have analyzed the existing methods for generating adversarial attacks and presented their comparative analysis. We have clearly drawn the outline to indicate the efficiency of the adversarial techniques, and based on the lacunae found in the existing studies, we have stated the future scope.
引用
收藏
页码:88279 / 88302
页数:24
相关论文
共 50 条
  • [1] Survey of adversarial attacks on speech recognition
    He Y.
    Hu M.
    Peng Z.
    Deng X.
    Liu S.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (02): : 10 - 18
  • [2] Blackbox Adversarial Attacks and Explanations for Automatic Speech Recognition
    Wu, Xiaoliang
    PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, : 1765 - 1769
  • [3] Adversarial Examples for Automatic Speech Recognition: Attacks and Countermeasures
    Hu, Shengshan
    Shang, Xingcan
    Qin, Zhan
    Li, Minghui
    Wang, Qian
    Wang, Cong
    IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (10) : 120 - 126
  • [4] Adversarial Attacks Against Automatic Speech Recognition Systems via Psychoacoustic Hiding
    Schoenherr, Lea
    Kohls, Katharina
    Zeiler, Steffen
    Holz, Thorsten
    Kolossa, Dorothea
    26TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2019), 2019,
  • [5] Feature extraction for automatic speech recognition (ASR)
    Swartz, B
    Magotra, N
    THIRTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1997, : 748 - 751
  • [6] DETECTING ADVERSARIAL ATTACKS ON AUDIOVISUAL SPEECH RECOGNITION
    Ma, Pingchuan
    Petridis, Stavros
    Pantic, Maja
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6403 - 6407
  • [7] Towards Query-Efficient Adversarial Attacks Against Automatic Speech Recognition Systems
    Wang, Qian
    Zheng, Baolin
    Li, Qi
    Shen, Chao
    Ba, Zhongjie
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 896 - 908
  • [8] Query-Efficient Black-Box Adversarial Attacks on Automatic Speech Recognition
    Tong, Chuxuan
    Zheng, Xi
    Li, Jianhua
    Ma, Xingjun
    Gao, Longxiang
    Xiang, Yong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3981 - 3992
  • [9] Automatic speech recognition: a survey
    Mishaim Malik
    Muhammad Kamran Malik
    Khawar Mehmood
    Imran Makhdoom
    Multimedia Tools and Applications, 2021, 80 : 9411 - 9457
  • [10] Automatic speech recognition: a survey
    Malik, Mishaim
    Malik, Muhammad Kamran
    Mehmood, Khawar
    Makhdoom, Imran
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9411 - 9457