Speech Emotion Recognition of Merged Features Based on Improved Convolutional Neural Network

被引:0
|
作者
Peng, Wangyue [1 ]
Tang, Xiaoyu [1 ]
机构
[1] South China Normal Univ, Sch Phys & Telecommun Engn, Guangzhou, Peoples R China
关键词
component; speech emotion recognition; log-mel spectrogram; convolutional neural network; affective computing;
D O I
10.1109/icicsp48821.2019.8958556
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The correlation between speech and emotions provides the possibility for the realization of psychological forewarning system. Because of the unexpected noise when transferring speech into semantic information, directly obtaining emotion from voice information may perform more efficient and natural in speech emotion recognition. The key of improving the effect of speech emotion recognition is to introduce more reliable features and realized better accuracy. Based on this, 384 dimension hand-crafted features extracted from speech acting as the basic features of speech. At the same time, the multi-dimension learning features obtained from log-mel spectrogram through the improved convolutional neural network combined with long short term memory recurrent neural network will act as the highly features. By applying this method into speech emotion recognition, merged features have been proved to be effective. Compared with the results of only relying on features learning from the log-mel spectrogram, the merged features showed better performance.
引用
收藏
页码:301 / 305
页数:5
相关论文
共 50 条
  • [1] Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features
    Mohanty, Aniruddha
    Cherukuri, Ravindranath C.
    Prusty, Alok Ranjan
    [J]. THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 117 - 129
  • [2] Speech Emotion Recognition through Hybrid Features and Convolutional Neural Network
    Alluhaidan, Ala Saleh
    Saidani, Oumaima
    Jahangir, Rashid
    Nauman, Muhammad Asif
    Neffati, Omnia Saidani
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [3] Convolutional Neural Network with Spectrogram and Perceptual Features for Speech Emotion Recognition
    Zhang, Linjuan
    Wang, Longbiao
    Dang, Jianwu
    Guo, Lili
    Guan, Haotian
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 62 - 71
  • [4] Speech Emotion Recognition Based on Multiple Acoustic Features and Deep Convolutional Neural Network
    Bhangale, Kishor
    Kothandaraman, Mohanaprasad
    [J]. ELECTRONICS, 2023, 12 (04)
  • [5] Speech emotion recognition based on improved masking EMD and convolutional recurrent neural network
    Sun, Congshan
    Li, Haifeng
    Ma, Lin
    [J]. FRONTIERS IN PSYCHOLOGY, 2023, 13
  • [6] Speech Emotion Recognition based on Interactive Convolutional Neural Network
    Cheng, Huihui
    Tang, Xiaoyu
    [J]. 2020 IEEE 3RD INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP 2020), 2020, : 163 - 167
  • [7] Deep and shallow features fusion based on deep convolutional neural network for speech emotion recognition
    Sun L.
    Chen J.
    Xie K.
    Gu T.
    [J]. International Journal of Speech Technology, 2018, 21 (4) : 931 - 940
  • [8] Parallelized Convolutional Recurrent Neural Network With Spectral Features for Speech Emotion Recognition
    Jiang, Pengxu
    Fu, Hongliang
    Tao, Huawei
    Lei, Peizhi
    Zhao, Li
    [J]. IEEE ACCESS, 2019, 7 : 90368 - 90377
  • [9] Constructing Speech Emotion Recognition Model Based on Convolutional Neural Network
    Kuo, Jong-Yih
    Chen, Zhao-Ming
    Lin, Hui-Chi
    [J]. 2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE WORKSHOPS (APSECW 2021), 2021, : 52 - 56
  • [10] Design of a Convolutional Neural Network for Speech Emotion Recognition
    Lee, Kyong Hee
    Kim, Do Hyun
    [J]. 11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1332 - 1335