An efficient low-perceptual environmental sound classification adversarial method based on GAN

被引:0
|
作者
Zhang, Qiang [1 ]
Yang, Jibin [2 ]
Zhang, Xiongwei [2 ]
Cao, Tieyong [2 ]
机构
[1] Army Engn Univ, Grad Sch, Nanjing 210007, Peoples R China
[2] Army Engn Univ, Command & Control Engn Coll, Nanjing 210007, Peoples R China
基金
中国国家自然科学基金;
关键词
Environmental sound classification; Deep learning; Generative Adversarial Network; Short-time spectrum; Adversarial example;
D O I
10.1007/s11042-024-18318-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
By incorporating additive perturbations to real samples, adversarial examples have notably exhibited the capability to deceive deep neural networks. Although the existing audio adversarial methods can successfully attack environmental sound classification (ESC) models, these generated perturbations can be easily perceived by humans. And the perturbations cannot be generated efficiently because of the large adversarial perturbation search space in audio. To address the problems, this paper proposes a Short-time Spectrum Generative Adversarial Network-based (StS-GAN) attack method to improve the performance of generated adversaries. In this method, a GAN is implemented to generate the magnitude spectrum perturbations with real signal magnitude spectra as inputs, and adversarial magnitude spectra are obtained as the superposition of the real signal magnitude spectra and the perturbations. Additionally, a short-time processing scheme is adopted to flexibly adjust the input length of the generator to balance computational complexity and attack performance. Through adversarial training, StS-GAN learns to generate adversarial examples with temporal-spectral characteristics similar to those of real signals. The learned perturbations tend to have smaller energies, making them less significant and less distinguishable by human perception. Thorough experiments show that, compared to existing adversarial attack methods, the proposed method achieves a higher Attack Success Rate (ASR) and efficiency, and the generated perturbations are less likely to be perceived by humans. The average ASR reaches 97% while maintaining a mean energy ratio of above 30 dB between the real signal and the generated perturbation, demonstrating the effectiveness of the proposed method.
引用
收藏
页码:80847 / 80872
页数:26
相关论文
共 50 条
  • [1] Combinatorial Adversarial Defense for Environmental Sound Classification Based on GAN
    Zhang, Qiang
    Yang, Jibin
    Zhang, Xiongwei
    Cao, Tieyong
    Li, Yihao
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2023, 45 (12): : 4399 - 4410
  • [2] EnvGAN: a GAN-based augmentation to improve environmental sound classification
    Aswathy Madhu
    Suresh K.
    [J]. Artificial Intelligence Review, 2022, 55 : 6301 - 6320
  • [3] EnvGAN: a GAN-based augmentation to improve environmental sound classification
    Madhu, Aswathy
    Suresh, K.
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (08) : 6301 - 6320
  • [4] Reverse Adversarial Attack To Enhance Environmental Sound Classification
    Tripathi, Achyut Mani
    Behera, Swarup Ranjan
    Paul, Konark
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [5] An efficient code for environmental sound classification
    Arora, Raman
    Lutfi, Robert A.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (01): : 7 - 10
  • [6] Large Scale Environmental Sound Classification based on Efficient Feature Extraction
    Wang, Xiaoyan
    Zhou, Hao
    Liu, Zhi
    Gu, Yu
    [J]. PROCEEDINGS OF 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW 2016), 2016, : 421 - 425
  • [7] Data Augmentation Using Generative Adversarial Network for Environmental Sound Classification
    Madhu, Aswathy
    Kumaraswamy, Suresh
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [8] Adv-ESC: Adversarial attack datasets for an environmental sound classification
    Tripathi, Achyut Mani
    Mishra, Aakansha
    [J]. APPLIED ACOUSTICS, 2022, 185
  • [9] A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation
    Zeng, Jinfang
    Li, Youming
    Zhang, Yu
    Chen, Da
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2021, 20 (03)
  • [10] Environmental Sound Classification Method Based on Compact Bilinear Attention Network
    Dong, Shaojiang
    Xia, Zhengfu
    Cai, Weiwei
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (06): : 102 - 107