COUPLED UNSUPERVISED DEEP CONVOLUTIONAL DOMAIN ADAPTATION FOR SPEECH EMOTION RECOGNITION

被引:0
|
作者
Noi, Ocquaye Elias Nii [1 ]
Mao, Qirong [1 ]
Xu, Guopeng [1 ]
Xue, Yanfei [1 ]
机构
[1] Jiangsu Univ, Comp Sci, Zhenjiang, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Speech emotion recognition; unsupervised domain adaptation; convolutional neural network; correlation alignment;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The assumption of identical feature distributions in both training and testing datasets in Speech Emotion Recognition is not applicable in many scenarios, especially when different datasets are involved in the training and testing phases. Domain Adaptation methods have been proposed over the years to curtail this issue. In this paper, we propose an Unsupervised Domain Adaptation method which is a Coupled Deep Convolutional Neural Network (CDCNN) architecture. The architecture uses the correlation alignment loss (CALoss) of both the source and target distributions without target labels to effectively minimize the domain shift and learn extremely good nonlinear transformations. Also, the weights in the corresponding layers in both streams are not shared yet related which is effective for modeling the shift of one domain to the other. To evaluate our proposed method, we use the INTERSPEECH 2009 Emotion Challenge's FAU Aibo Emotion Corpus as target dataset and two publicly available corpora (ABC and Emo-DB) as source dataset. Experimental results indicate that our proposed method is superior to other state-of-the-arts-methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Dual Exclusive Attentive Transfer for Unsupervised Deep Convolutional Domain Adaptation in Speech Emotion Recognition
    Ocquaye, Elias Nii Noi
    Mao, Qirong
    Song, Heping
    Xu, Guopeng
    Xue, Yanfei
    [J]. IEEE ACCESS, 2019, 7 : 93847 - 93857
  • [2] Unsupervised domain adaptation for speech emotion recognition using PCANet
    Huang, Zhengwei
    Xue, Wentao
    Mao, Qirong
    Zhan, Yongzhao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (05) : 6785 - 6799
  • [3] Unsupervised domain adaptation for speech emotion recognition using PCANet
    Zhengwei Huang
    Wentao Xue
    Qirong Mao
    Yongzhao Zhan
    [J]. Multimedia Tools and Applications, 2017, 76 : 6785 - 6799
  • [4] Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition
    Deng, Jun
    Zhang, Zixing
    Eyben, Florian
    Schuller, Bjoern
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) : 1068 - 1072
  • [5] An unsupervised deep domain adaptation approach for robust speech recognition
    Sun, Sining
    Zhang, Binbin
    Xie, Lei
    Zhang, Yanning
    [J]. NEUROCOMPUTING, 2017, 257 : 79 - 87
  • [6] Deep Unsupervised Convolutional Domain Adaptation
    Zhuo, Junbao
    Wang, Shuhui
    Zhang, Weigang
    Huang, Qingming
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 261 - 269
  • [7] Unsupervised Adversarial Domain Adaptation for Cross-Lingual Speech Emotion Recognition
    Latif, Siddique
    Qadir, Junaid
    Bilal, Muhammad
    [J]. 2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2019,
  • [8] Unsupervised domain adaptation for speech recognition with unsupervised error correction
    Mai, Long
    Carson-Berndsen, Julie
    [J]. INTERSPEECH 2022, 2022, : 5120 - 5124
  • [9] Deep Unsupervised Domain Adaptation for Face Recognition
    Luo, Zimeng
    Hu, Jiani
    Deng, Weihong
    Shen, Haifeng
    [J]. PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 453 - 457
  • [10] Speech emotion recognition with deep convolutional neural networks
    Issa, Dias
    Demirci, M. Fatih
    Yazici, Adnan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 59 (59)