Ensemble deep learning with HuBERT for speech emotion recognition

被引:2
|
作者
Yang, Janghoon [1 ]
机构
[1] Seoul Media Inst Technol, AI Software Engn, Seoul, South Korea
关键词
transformer; ensemble model; HuBERT; speech emotion recognition;
D O I
10.1109/ICSC56153.2023.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a deep learning model for speech emotion recognition which consists of HuBERT, statistical feature extraction, and transformer. Several different ensemble methods are also considered. The proposed method is shown to achieve a test accuracy of 68.31% for CREMA-D. The ensemble method of summing the outputs of multiple models and determining the emotion is shown to achieve an accuracy of 70.24% with the ensemble of 5 models which is close to the state of the art (SOTA) performance with CREMA-D.
引用
收藏
页码:153 / 154
页数:2
相关论文
共 50 条
  • [1] Deep Learning, Ensemble and Supervised Machine Learning for Arabic Speech Emotion Recognition
    Ismaiel, Wahiba
    Alhalangy, Abdalilah
    Mohamed, Adil. O. Y.
    Musa, Abdalla Ibrahim Abdalla
    [J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2024, 14 (02) : 13757 - 13764
  • [2] SPEECH EMOTION RECOGNITION WITH ENSEMBLE LEARNING METHODS
    Shih, Po-Yuan
    Chen, Chia-Ping
    Wu, Chung-Hsien
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2756 - 2760
  • [3] Speech Emotion Recognition with Deep Learning
    Harar, Pavol
    Burget, Radim
    Dutta, Malay Kishore
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 137 - 140
  • [4] Emotion Recognition on Multimodal with Deep Learning and Ensemble
    Dharma, David Adi
    Zahra, Amalia
    [J]. International Journal of Advanced Computer Science and Applications, 2022, 13 (12): : 656 - 663
  • [5] Emotion Recognition on Multimodal with Deep Learning and Ensemble
    Dharma, David Adi
    Zahra, Amalia
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 656 - 663
  • [6] Emotion Recognition in Speech with Deep Learning Architectures
    Erdal, Mehmet
    Kaechele, Markus
    Schwenker, Friedhelm
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, 2016, 9896 : 298 - 311
  • [7] Speech Emotion Recognition Using Deep Learning
    Alagusundari, N.
    Anuradha, R.
    [J]. ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
  • [8] Speech Emotion Recognition Using Deep Learning
    Ahmed, Waqar
    Riaz, Sana
    Iftikhar, Khunsa
    Konur, Savas
    [J]. ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
  • [9] Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques
    Mihalache, Serban
    Burileanu, Dragos
    [J]. ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2023, 26 (3-4): : 375 - 387
  • [10] Ensemble Learning of Hybrid Acoustic Features for Speech Emotion Recognition
    Zvarevashe, Kudakwashe
    Olugbara, Oludayo
    [J]. ALGORITHMS, 2020, 13 (03)