Semi-supervised Model for Emotion Recognition in Speech

被引:3
|
作者
Pereira, Ingryd [1 ]
Santos, Diego [2 ]
Maciel, Alexandre [1 ]
Barros, Pablo [3 ]
机构
[1] Univ Pernambuco, Polytech Sch Pernambuco, Recife, PE, Brazil
[2] Fedreal Univ Pernambuco, Recife, PE, Brazil
[3] Univ Hamburg, Dept Informat, Knowledge Technol, Hamburg, Germany
关键词
Emotion recognition; Semi-supervised learning; GAN; Speech representation; Deep learning;
D O I
10.1007/978-3-030-01418-6_77
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To recognize emotional traits on speech is a challenging task which became very popular in the past years, especially due to the recent advances in deep neural networks. Although very successful, these models inherited a common problem from strongly supervised deep neural networks: a large number of strongly labeled samples demands necessary, so the model learns a general emotion representation. This paper proposes a solution for this problem with the development of a semi-supervised neural network which can learn speech representation from unlabeled samples and used them in different emotion recognition in speech scenarios. We provide experiments with different datasets, representing natural and controlled scenarios. Our results show that our model is competitive with state-of-the-art solutions in all these scenarios while sharing the same learned representations, which were learned without the necessity of strong labeled data.
引用
收藏
页码:791 / 800
页数:10
相关论文
共 50 条
  • [1] Semi-supervised Ladder Networks for Speech Emotion Recognition
    Jian-Hua Tao
    Jian Huang
    Ya Li
    Zheng Lian
    Ming-Yue Niu
    [J]. International Journal of Automation and Computing, 2019, 16 : 437 - 448
  • [2] Semi-Supervised Speech Emotion Recognition With Ladder Networks
    Parthasarathy, Srinivas
    Busso, Carlos
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2697 - 2709
  • [3] Semi-supervised Ladder Networks for Speech Emotion Recognition
    Tao, Jian-Hua
    Huang, Jian
    Li, Ya
    Lian, Zheng
    Niu, Ming-Yue
    [J]. INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2019, 16 (04) : 437 - 448
  • [4] Semi-supervised Ladder Networks for Speech Emotion Recognition
    Jian-Hua Tao
    Jian Huang
    Ya Li
    Zheng Lian
    Ming-Yue Niu
    [J]. International Journal of Automation and Computing, 2019, (04) : 437 - 448
  • [5] Semi-supervised cross-lingual speech emotion recognition
    Agarla, Mirko
    Bianco, Simone
    Celona, Luigi
    Napoletano, Paolo
    Petrovsky, Alexey
    Piccoli, Flavio
    Schettini, Raimondo
    Shanin, Ivan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [6] Semi-supervised parallel shared encoders for speech emotion recognition
    Pourebrahim, Yousef
    Razzazi, Farbod
    Sameti, Hossein
    [J]. DIGITAL SIGNAL PROCESSING, 2021, 118
  • [7] Correction to: Semi-supervised Ladder Networks for Speech Emotion Recognition
    Jian-Hua Tao
    Jian Huang
    Ya Li
    Zheng Lian
    Ming-Yue Niu
    [J]. International Journal of Automation and Computing, 2021, 18 : 680 - 680
  • [8] Confidence Measures in Speech Emotion Recognition Based on Semi-supervised Learning
    Deng, Jun
    Schuller, Bjoern
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2223 - 2226
  • [9] Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks
    Huang, Jian
    Li, Ya
    Tao, Jianhua
    Lian, Zheng
    Niu, Mingyue
    Yi, Jiangyan
    [J]. 2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [10] Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition
    Latif, Siddique
    Rana, Rajib
    Khalifa, Sara
    Jurdak, Raja
    Epps, Julien
    Schuller, Bjoern W.
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (02) : 992 - 1004