Adversarial Attacks on Automatic Speech Recognition (ASR): A Survey

被引：1

作者：

Bhanushali, Amisha Rajnikant ^{[1
]}

Mun, Hyunjun ^{[1
]}

Yun, Joobeom ^{[1
]}

机构：

[1] Sejong Univ, Dept Comp & Informat Secur & Convergence Engn Inte, Seoul 05006, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Surveys; Taxonomy; Automatic speech recognition; Task analysis; Internet; Text recognition; Adversarial machine learning; Artificial neural networks; Adversarial attacks; adversarial samples; automatic speech recognition (ASR); deep neural network (DNN); COEFFICIENTS; EXAMPLES;

D O I：

10.1109/ACCESS.2024.3416965

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Automatic Speech Recognition (ASR) systems have improved and eased how humans interact with devices. ASR system converts an acoustic waveform into the relevant text form. Modern ASR inculcates deep neural networks (DNNs) to provide faster and better results. As the use of DNN continues to expand, there is a need for examination against various adversarial attacks. Adversarial attacks are synthetic samples crafted carefully by adding particular noise to legitimate examples. They are imperceptible, yet they prove catastrophic to DNNs. Recently, adversarial attacks on ASRs have increased but previous surveys lack generalization of the different methods used for attacking ASR, and the scope of the study is narrowed to a particular application, making it difficult to determine the relationships and trade-offs between the attack techniques. Therefore, this survey provides a taxonomy illustrating the classification of the adversarial attacks on ASR based on their characteristics and behavior. Additionally, we have analyzed the existing methods for generating adversarial attacks and presented their comparative analysis. We have clearly drawn the outline to indicate the efficiency of the adversarial techniques, and based on the lacunae found in the existing studies, we have stated the future scope.

引用

页码：88279 / 88302

页数：24

共 50 条

[41] Machine Learning in Automatic Speech Recognition: A Survey
Padmanabhan, Jayashree
Premkumar, Melvin Jose Johnson
IETE TECHNICAL REVIEW, 2015, 32 (04) : 240 - 251
[42] A Survey of Multilingual Models for Automatic Speech Recognition
Yadav, Hemant
Sitaram, Sunayana
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5071 - 5079
[43] A detailed survey of Turkish automatic speech recognition
Arslan, Recep Sinan
Barisci, Necaattin
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2020, 28 (06) : 3253 - 3269
[44] A survey of technologies for automatic Dysarthric speech recognition
Zhaopeng Qian
Kejing Xiao
Chongchong Yu
EURASIP Journal on Audio, Speech, and Music Processing, 2023
[45] Adversarial Black-Box Attacks on Automatic Speech Recognition Systems using Multi-Objective Evolutionary Optimization
Khare, Shreya
Aralikatte, Rahul
Mani, Senthil
INTERSPEECH 2019, 2019, : 3208 - 3212
[46] Paradoxical Role of Adversarial Attacks: Enabling Crosslinguistic Attacks and Information Hiding in Multilingual Speech Recognition
Zhang, Wenjie
Xia, Zhihua
Ma, Bin
Yan, Diqun
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1046 - 1050
[47] Recent improvements of ASR models in the face of adversarial attacks
Olivier, Raphael
Raj, Bhiksha
INTERSPEECH 2022, 2022, : 4113 - 4117
[48] Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders
Kitzing, Peter
Maier, Andreas
Ahlander, Viveka Lyberg
LOGOPEDICS PHONIATRICS VOCOLOGY, 2009, 34 (02) : 91 - 96
[49] Towards Visualizing and Detecting Audio Adversarial Examples for Automatic Speech Recognition
Zong, Wei
Chow, Yang-Wai
Susilo, Willy
INFORMATION SECURITY AND PRIVACY, ACISP 2021, 2021, 13083 : 531 - 549
[50] JOINT AND ADVERSARIAL TRAINING WITH ASR FOR EXPRESSIVE SPEECH SYNTHESIS
Zhang, Kaili
Gong, Cheng
Lu, Wenhuan
Wang, Longbiao
Wei, Jianguo
Liu, Dawei
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6322 - 6326

← 1 2 3 4 5 →