Deep and shallow features fusion based on deep convolutional neural network for speech emotion recognition

被引：34

作者：

Sun L. ^{[1
,2
]}

Chen J. ^{[1
]}

Xie K. ^{[1
]}

Gu T. ^{[1
]}

机构：

[1] College of Telecommunications & Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing

[2] Key Lab of Broadband Wireless Communication and Sensor Network Technology, Ministry of Education, Nanjing University of Posts and Telecommunications, Nanjing

来源：

International Journal of Speech Technology | 2018年 / 21卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Deep and shallow feature fusion; Deep convolutional neutral network; Speech emotion recognition;

D O I：

10.1007/s10772-018-9551-4

中图分类号：

学科分类号：

摘要：

Recent years have witnessed the great progress for speech emotion recognition using deep convolutional neural networks (DCNNs). In order to improve the performance of speech emotion recognition, a novel feature fusion method is proposed. With going deeper of the convolutional layers, the convolutional feature of traditional DCNNs gradually become more abstract, which may not be the best feature for speech emotion recognition. On the other hand, the shallow feature includes only global information without the detailed information extracted by deeper convolutional layers. According to these observations, we design a deep and shallow feature fusion convolutional network, which combines the feature from different levels of network for speech emotion recognition. The proposed network allows us to fully exploit deep and shallow feature. The popular Berlin data set is used in our experiments, the experimental results show that our proposed network can further improve speech emotion recognition rate which demonstrates the effectiveness of the proposed network. © 2018, Springer Science+Business Media, LLC, part of Springer Nature.

引用

页码：931 / 940

页数：9

共 50 条

[1] Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features
Mohanty, Aniruddha
Cherukuri, Ravindranath C.
Prusty, Alok Ranjan
THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 117 - 129
[2] Speech Emotion Recognition Based on Multiple Acoustic Features and Deep Convolutional Neural Network
Bhangale, Kishor
Kothandaraman, Mohanaprasad
ELECTRONICS, 2023, 12 (04)
[3] Research on Speech Emotion Recognition Technology based on Deep and Shallow Neural Network
Wang, Jian
Han, Zhiyan
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 3555 - 3558
[4] Speech Emotion Recognition Based on Deep Neural Network
Zhu, Zijiang
Hu, Yi
Li, Junshan
Li, Jianjun
Wang, Junhua
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 154 - 154
[5] Speech Emotion Recognition from Spectrograms with Deep Convolutional Neural Network
Badshah, Abdul Malik
Ahmad, Jamil
Rahim, Nasir
Baik, Sung Wook
2017 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON), 2017, : 125 - 129
[6] Speech emotion recognition with deep convolutional neural networks
Issa, Dias
Demirci, M. Fatih
Yazici, Adnan
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 59
[7] Recognition of emotion in music based on deep convolutional neural network
Rajib Sarkar
Sombuddha Choudhury
Saikat Dutta
Aneek Roy
Sanjoy Kumar Saha
Multimedia Tools and Applications, 2020, 79 : 765 - 783
[8] Recognition of emotion in music based on deep convolutional neural network
Sarkar, Rajib
Choudhury, Sombuddha
Dutta, Saikat
Roy, Aneek
Saha, Sanjoy Kumar
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (1-2) : 765 - 783
[9] Improvement on Speech Emotion Recognition Based on Deep Convolutional Neural Networks
Niu, Yafeng
Zou, Dongsheng
Niu, Yadong
He, Zhongshi
Tan, Hua
PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE (ICCAI 2018), 2018, : 13 - 18
[10] Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features
Lee, Ming-Che
Yeh, Sheng-Cheng
Chang, Jia-Wei
Chen, Zhen-Yi
SENSORS, 2022, 22 (13)

← 1 2 3 4 5 →