Adversarial Attacks on Automatic Speech Recognition (ASR): A Survey

被引：1

作者：

Bhanushali, Amisha Rajnikant ^{[1
]}

Mun, Hyunjun ^{[1
]}

Yun, Joobeom ^{[1
]}

机构：

[1] Sejong Univ, Dept Comp & Informat Secur & Convergence Engn Inte, Seoul 05006, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Surveys; Taxonomy; Automatic speech recognition; Task analysis; Internet; Text recognition; Adversarial machine learning; Artificial neural networks; Adversarial attacks; adversarial samples; automatic speech recognition (ASR); deep neural network (DNN); COEFFICIENTS; EXAMPLES;

D O I：

10.1109/ACCESS.2024.3416965

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Automatic Speech Recognition (ASR) systems have improved and eased how humans interact with devices. ASR system converts an acoustic waveform into the relevant text form. Modern ASR inculcates deep neural networks (DNNs) to provide faster and better results. As the use of DNN continues to expand, there is a need for examination against various adversarial attacks. Adversarial attacks are synthetic samples crafted carefully by adding particular noise to legitimate examples. They are imperceptible, yet they prove catastrophic to DNNs. Recently, adversarial attacks on ASRs have increased but previous surveys lack generalization of the different methods used for attacking ASR, and the scope of the study is narrowed to a particular application, making it difficult to determine the relationships and trade-offs between the attack techniques. Therefore, this survey provides a taxonomy illustrating the classification of the adversarial attacks on ASR based on their characteristics and behavior. Additionally, we have analyzed the existing methods for generating adversarial attacks and presented their comparative analysis. We have clearly drawn the outline to indicate the efficiency of the adversarial techniques, and based on the lacunae found in the existing studies, we have stated the future scope.

引用

页码：88279 / 88302

页数：24

共 50 条

[31] Black-box adversarial attacks through speech distortion for speech emotion recognition
Jinxing Gao
Diqun Yan
Mingyu Dong
EURASIP Journal on Audio, Speech, and Music Processing, 2022
[32] Adversarial attacks and defenses in deep learning for image recognition: A survey
Wang, Jia
Wang, Chengyu
Lin, Qiuzhen
Luo, Chengwen
Wu, Chao
Li, Jianqiang
NEUROCOMPUTING, 2022, 514 : 162 - 181
[33] CommanderUAP: a practical and transferable universal adversarial attacks on speech recognition models
Sun, Zheng
Zhao, Jinxiao
Guo, Feng
Chen, Yuxuan
Ju, Lei
CYBERSECURITY, 2024, 7 (01):
[34] A Systematic Evaluation of Adversarial Attacks against Speech Emotion Recognition Models
Facchinetti, Nicolas
Simonetta, Federico
Ntalampiras, Stavros
Intelligent Computing, 2024, 3
[35] Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition
Zhang, Yike
Zhang, Pengyuan
Yan, Yonghong
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3348 - 3352
[36] Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
Dhawan, Kunal
Koluguri, Nithin Rao
Jukic, Ante
Langman, Ryan
Balam, Jagadeesh
Ginsburg, Boris
INTERSPEECH 2024, 2024, : 2574 - 2578
[37] Classification Techniques for Automatic Speech Recognition (ASR) Algorithms used with Real Time Speech Translation
Nasereddin, Hebah H. O.
Omari, Ayoub Abdel Rahman
2017 COMPUTING CONFERENCE, 2017, : 200 - 207
[38] Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition
Qin, Yao
Carlini, Nicholas
Goodfellow, Ian
Cottrell, Garrison
Raffel, Colin
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[39] Effective Adversarial Sample Detection for Securing Automatic Speech Recognition
Lin, Chih-Yang
Wang, Yan-Zhang
Lin, Shou-Kuan
Farady, Isack
Jan, Yih-Kuen
Lin, Wei-Yang
2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, AVSS 2024, 2024,
[40] A survey of technologies for automatic Dysarthric speech recognition
Qian, Zhaopeng
Xiao, Kejing
Yu, Chongchong
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)

← 1 2 3 4 5 →