Adversarial Attacks on Automatic Speech Recognition (ASR): A Survey

被引:1
|
作者
Bhanushali, Amisha Rajnikant [1 ]
Mun, Hyunjun [1 ]
Yun, Joobeom [1 ]
机构
[1] Sejong Univ, Dept Comp & Informat Secur & Convergence Engn Inte, Seoul 05006, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Surveys; Taxonomy; Automatic speech recognition; Task analysis; Internet; Text recognition; Adversarial machine learning; Artificial neural networks; Adversarial attacks; adversarial samples; automatic speech recognition (ASR); deep neural network (DNN); COEFFICIENTS; EXAMPLES;
D O I
10.1109/ACCESS.2024.3416965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic Speech Recognition (ASR) systems have improved and eased how humans interact with devices. ASR system converts an acoustic waveform into the relevant text form. Modern ASR inculcates deep neural networks (DNNs) to provide faster and better results. As the use of DNN continues to expand, there is a need for examination against various adversarial attacks. Adversarial attacks are synthetic samples crafted carefully by adding particular noise to legitimate examples. They are imperceptible, yet they prove catastrophic to DNNs. Recently, adversarial attacks on ASRs have increased but previous surveys lack generalization of the different methods used for attacking ASR, and the scope of the study is narrowed to a particular application, making it difficult to determine the relationships and trade-offs between the attack techniques. Therefore, this survey provides a taxonomy illustrating the classification of the adversarial attacks on ASR based on their characteristics and behavior. Additionally, we have analyzed the existing methods for generating adversarial attacks and presented their comparative analysis. We have clearly drawn the outline to indicate the efficiency of the adversarial techniques, and based on the lacunae found in the existing studies, we have stated the future scope.
引用
收藏
页码:88279 / 88302
页数:24
相关论文
共 50 条
  • [41] Machine Learning in Automatic Speech Recognition: A Survey
    Padmanabhan, Jayashree
    Premkumar, Melvin Jose Johnson
    IETE TECHNICAL REVIEW, 2015, 32 (04) : 240 - 251
  • [42] A Survey of Multilingual Models for Automatic Speech Recognition
    Yadav, Hemant
    Sitaram, Sunayana
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5071 - 5079
  • [43] A detailed survey of Turkish automatic speech recognition
    Arslan, Recep Sinan
    Barisci, Necaattin
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2020, 28 (06) : 3253 - 3269
  • [44] A survey of technologies for automatic Dysarthric speech recognition
    Zhaopeng Qian
    Kejing Xiao
    Chongchong Yu
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [45] Adversarial Black-Box Attacks on Automatic Speech Recognition Systems using Multi-Objective Evolutionary Optimization
    Khare, Shreya
    Aralikatte, Rahul
    Mani, Senthil
    INTERSPEECH 2019, 2019, : 3208 - 3212
  • [46] Paradoxical Role of Adversarial Attacks: Enabling Crosslinguistic Attacks and Information Hiding in Multilingual Speech Recognition
    Zhang, Wenjie
    Xia, Zhihua
    Ma, Bin
    Yan, Diqun
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1046 - 1050
  • [47] Recent improvements of ASR models in the face of adversarial attacks
    Olivier, Raphael
    Raj, Bhiksha
    INTERSPEECH 2022, 2022, : 4113 - 4117
  • [48] Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders
    Kitzing, Peter
    Maier, Andreas
    Ahlander, Viveka Lyberg
    LOGOPEDICS PHONIATRICS VOCOLOGY, 2009, 34 (02) : 91 - 96
  • [49] Towards Visualizing and Detecting Audio Adversarial Examples for Automatic Speech Recognition
    Zong, Wei
    Chow, Yang-Wai
    Susilo, Willy
    INFORMATION SECURITY AND PRIVACY, ACISP 2021, 2021, 13083 : 531 - 549
  • [50] JOINT AND ADVERSARIAL TRAINING WITH ASR FOR EXPRESSIVE SPEECH SYNTHESIS
    Zhang, Kaili
    Gong, Cheng
    Lu, Wenhuan
    Wang, Longbiao
    Wei, Jianguo
    Liu, Dawei
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6322 - 6326