GENERATING AND PROTECTING AGAINST ADVERSARIAL ATTACKS FOR DEEP SPEECH-BASED EMOTION RECOGNITION MODELS

被引:0
|
作者
Ren, Zhao [1 ]
Baird, Alice [1 ]
Han, Jing [1 ]
Zhang, Zixing [2 ]
Schuller, Bjoern [1 ,2 ]
机构
[1] Univ Augsburg, ZD B Chair Embedded Intelligence Hlth Care & Well, Augsburg, Germany
[2] Imperial Coll London, GLAM Grp Language Audio & Mus, London, England
基金
欧盟地平线“2020”;
关键词
Speech Emotion Recognition; Adversarial Attacks; Adversarial Training; Convolutional Neural Network;
D O I
10.1109/icassp40776.2020.9054087
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The development of deep learning models for speech emotion recognition has become a popular area of research. Adversarially generated data can cause false predictions, and in an endeavor to ensure model robustness, defense methods against such attacks should be addressed. With this in mind, in this study, we aim to train deep models to defending against non-targeted white-box adversarial attacks. Adversarial data is first generated from the real data using the fast gradient sign method. Then in the research field of speech emotion recognition, adversarial-based training is employed as a method for protecting against adversarial attack. We then train deep convolutional models with both real and adversarial data, and compare the performances of two adversarial training procedures - namely, vanilla adversarial training, and similarity-based adversarial training. In our experiments, through the use of adversarial data augmentation, both of the considered adversarial training procedures can improve the performance when validated on the real data. Additionally, the similarity-based adversarial training learns a more robust model when working with adversarial data. Finally, the considered VGG-16 model performs the best across all models, for both real and generated data.
引用
收藏
页码:7184 / 7188
页数:5
相关论文
共 50 条
  • [1] Effect of Reverberation in Speech-based Emotion Recognition
    Zhao, Shujie
    Yang, Yan
    Chen, Jingdong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
  • [2] Towards Robust Speech-Based Emotion Recognition
    Tabatabaei, Talieh S.
    Krishnan, Sridhar
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [3] An investigation of speech-based human emotion recognition
    Wang, YJ
    Guan, L
    [J]. 2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 15 - 18
  • [4] Speech-based Emotion Recognition and Next Reaction Prediction
    Noroozi, Fatemeh
    Akrami, Neda
    Anbarjafari, Gholamreza
    [J]. 2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [5] DeepAdversaryDefense: A Deep Model to Identify and Prevent Adversarial Attacks against Medical Speech Recognition
    Panwar, Kirtee
    Singh, Akansha
    Singh, Krishna Kant
    [J]. 5TH INTERNATIONAL CONFERENCE ON INFORMATICS & DATA-DRIVEN MEDICINE, IDDM 2022, 2022, 3302
  • [6] Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
    Nielsen, Christian Heider
    Tan, Zheng-Hua
    [J]. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2023, 4 : 179 - 187
  • [7] Temporal shuffling for defending deep action recognition models against adversarial attacks
    Hwang, Jaehui
    Zhang, Huan
    Choi, Jun-Ho
    Hsieh, Cho-Jui
    Lee, Jong-Seok
    [J]. NEURAL NETWORKS, 2024, 169 : 388 - 397
  • [8] A Watermarking-Based Framework for Protecting Deep Image Classifiers Against Adversarial Attacks
    Sun, Chen
    Yang, En-Hui
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3324 - 3333
  • [9] Utilizing Psychoacoustic Modeling to Improve Speech-Based Emotion Recognition
    Siegert, Ingo
    Lotz, Alicia Flores
    Egorow, Olga
    Wolff, Susann
    [J]. SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 625 - 635
  • [10] Black-box adversarial attacks through speech distortion for speech emotion recognition
    Gao, Jinxing
    Yan, Diqun
    Dong, Mingyu
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2022, 2022 (01)