An efficient low-perceptual environmental sound classification adversarial method based on GAN

被引：0

作者：

Zhang, Qiang ^{[1
]}

Yang, Jibin ^{[2
]}

Zhang, Xiongwei ^{[2
]}

Cao, Tieyong ^{[2
]}

机构：

[1] Army Engn Univ, Grad Sch, Nanjing 210007, Peoples R China

[2] Army Engn Univ, Command & Control Engn Coll, Nanjing 210007, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2024年 / 83卷 / 34期

基金：

中国国家自然科学基金;

关键词：

Environmental sound classification; Deep learning; Generative Adversarial Network; Short-time spectrum; Adversarial example;

D O I：

10.1007/s11042-024-18318-5

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

By incorporating additive perturbations to real samples, adversarial examples have notably exhibited the capability to deceive deep neural networks. Although the existing audio adversarial methods can successfully attack environmental sound classification (ESC) models, these generated perturbations can be easily perceived by humans. And the perturbations cannot be generated efficiently because of the large adversarial perturbation search space in audio. To address the problems, this paper proposes a Short-time Spectrum Generative Adversarial Network-based (StS-GAN) attack method to improve the performance of generated adversaries. In this method, a GAN is implemented to generate the magnitude spectrum perturbations with real signal magnitude spectra as inputs, and adversarial magnitude spectra are obtained as the superposition of the real signal magnitude spectra and the perturbations. Additionally, a short-time processing scheme is adopted to flexibly adjust the input length of the generator to balance computational complexity and attack performance. Through adversarial training, StS-GAN learns to generate adversarial examples with temporal-spectral characteristics similar to those of real signals. The learned perturbations tend to have smaller energies, making them less significant and less distinguishable by human perception. Thorough experiments show that, compared to existing adversarial attack methods, the proposed method achieves a higher Attack Success Rate (ASR) and efficiency, and the generated perturbations are less likely to be perceived by humans. The average ASR reaches 97% while maintaining a mean energy ratio of above 30 dB between the real signal and the generated perturbation, demonstrating the effectiveness of the proposed method.

引用

页码：80847 / 80872

页数：26

共 50 条

[1] Combinatorial Adversarial Defense for Environmental Sound Classification Based on GAN
Zhang, Qiang
Yang, Jibin
Zhang, Xiongwei
Cao, Tieyong
Li, Yihao
[J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2023, 45 (12): : 4399 - 4410
[2] EnvGAN: a GAN-based augmentation to improve environmental sound classification
Aswathy Madhu
Suresh K.
[J]. Artificial Intelligence Review, 2022, 55 : 6301 - 6320
[3] EnvGAN: a GAN-based augmentation to improve environmental sound classification
Madhu, Aswathy
Suresh, K.
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (08) : 6301 - 6320
[4] Reverse Adversarial Attack To Enhance Environmental Sound Classification
Tripathi, Achyut Mani
Behera, Swarup Ranjan
Paul, Konark
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[5] An efficient code for environmental sound classification
Arora, Raman
Lutfi, Robert A.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (01): : 7 - 10
[6] Large Scale Environmental Sound Classification based on Efficient Feature Extraction
Wang, Xiaoyan
Zhou, Hao
Liu, Zhi
Gu, Yu
[J]. PROCEEDINGS OF 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW 2016), 2016, : 421 - 425
[7] Data Augmentation Using Generative Adversarial Network for Environmental Sound Classification
Madhu, Aswathy
Kumaraswamy, Suresh
[J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[8] Adv-ESC: Adversarial attack datasets for an environmental sound classification
Tripathi, Achyut Mani
Mishra, Aakansha
[J]. APPLIED ACOUSTICS, 2022, 185
[9] A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation
Zeng, Jinfang
Li, Youming
Zhang, Yu
Chen, Da
[J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2021, 20 (03)
[10] Environmental Sound Classification Method Based on Compact Bilinear Attention Network
Dong, Shaojiang
Xia, Zhengfu
Cai, Weiwei
[J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (06): : 102 - 107

← 1 2 3 4 5 →