Adversarial Attacks on Automatic Speech Recognition (ASR): A Survey

被引:1
|
作者
Bhanushali, Amisha Rajnikant [1 ]
Mun, Hyunjun [1 ]
Yun, Joobeom [1 ]
机构
[1] Sejong Univ, Dept Comp & Informat Secur & Convergence Engn Inte, Seoul 05006, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Surveys; Taxonomy; Automatic speech recognition; Task analysis; Internet; Text recognition; Adversarial machine learning; Artificial neural networks; Adversarial attacks; adversarial samples; automatic speech recognition (ASR); deep neural network (DNN); COEFFICIENTS; EXAMPLES;
D O I
10.1109/ACCESS.2024.3416965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic Speech Recognition (ASR) systems have improved and eased how humans interact with devices. ASR system converts an acoustic waveform into the relevant text form. Modern ASR inculcates deep neural networks (DNNs) to provide faster and better results. As the use of DNN continues to expand, there is a need for examination against various adversarial attacks. Adversarial attacks are synthetic samples crafted carefully by adding particular noise to legitimate examples. They are imperceptible, yet they prove catastrophic to DNNs. Recently, adversarial attacks on ASRs have increased but previous surveys lack generalization of the different methods used for attacking ASR, and the scope of the study is narrowed to a particular application, making it difficult to determine the relationships and trade-offs between the attack techniques. Therefore, this survey provides a taxonomy illustrating the classification of the adversarial attacks on ASR based on their characteristics and behavior. Additionally, we have analyzed the existing methods for generating adversarial attacks and presented their comparative analysis. We have clearly drawn the outline to indicate the efficiency of the adversarial techniques, and based on the lacunae found in the existing studies, we have stated the future scope.
引用
收藏
页码:88279 / 88302
页数:24
相关论文
共 50 条
  • [31] Black-box adversarial attacks through speech distortion for speech emotion recognition
    Jinxing Gao
    Diqun Yan
    Mingyu Dong
    EURASIP Journal on Audio, Speech, and Music Processing, 2022
  • [32] Adversarial attacks and defenses in deep learning for image recognition: A survey
    Wang, Jia
    Wang, Chengyu
    Lin, Qiuzhen
    Luo, Chengwen
    Wu, Chao
    Li, Jianqiang
    NEUROCOMPUTING, 2022, 514 : 162 - 181
  • [33] CommanderUAP: a practical and transferable universal adversarial attacks on speech recognition models
    Sun, Zheng
    Zhao, Jinxiao
    Guo, Feng
    Chen, Yuxuan
    Ju, Lei
    CYBERSECURITY, 2024, 7 (01):
  • [34] A Systematic Evaluation of Adversarial Attacks against Speech Emotion Recognition Models
    Facchinetti, Nicolas
    Simonetta, Federico
    Ntalampiras, Stavros
    Intelligent Computing, 2024, 3
  • [35] Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition
    Zhang, Yike
    Zhang, Pengyuan
    Yan, Yonghong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3348 - 3352
  • [36] Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
    Dhawan, Kunal
    Koluguri, Nithin Rao
    Jukic, Ante
    Langman, Ryan
    Balam, Jagadeesh
    Ginsburg, Boris
    INTERSPEECH 2024, 2024, : 2574 - 2578
  • [37] Classification Techniques for Automatic Speech Recognition (ASR) Algorithms used with Real Time Speech Translation
    Nasereddin, Hebah H. O.
    Omari, Ayoub Abdel Rahman
    2017 COMPUTING CONFERENCE, 2017, : 200 - 207
  • [38] Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition
    Qin, Yao
    Carlini, Nicholas
    Goodfellow, Ian
    Cottrell, Garrison
    Raffel, Colin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [39] Effective Adversarial Sample Detection for Securing Automatic Speech Recognition
    Lin, Chih-Yang
    Wang, Yan-Zhang
    Lin, Shou-Kuan
    Farady, Isack
    Jan, Yih-Kuen
    Lin, Wei-Yang
    2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, AVSS 2024, 2024,
  • [40] A survey of technologies for automatic Dysarthric speech recognition
    Qian, Zhaopeng
    Xiao, Kejing
    Yu, Chongchong
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)