Research on Speech Emotional Feature Extraction Based on Multidimensional Feature Fusion

被引：2

作者：

Zheng, Chunjun ^{[1
,2
]}

Wang, Chunli ^{[1
]}

Sun, Wei ^{[2
]}

Jia, Ning ^{[2
]}

机构：

[1] Dalian Maritime Univ, Dalian, Liaoning, Peoples R China

[2] Dalian Neusoft Univ Informat, Dalian, Liaoning, Peoples R China

来源：

ADVANCED DATA MINING AND APPLICATIONS, ADMA 2019 | 2019年 / 11888卷

关键词：

Low-Level Acoustic Descriptors; Convolutional Recurrent Neural Network; Feature Fusion; Speech emotion recognition; RECOGNITION;

D O I：

10.1007/978-3-030-35231-8_39

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the field of speech processing, speech emotion recognition is a challenging task with broad application prospects. Since the effective speech feature set directly affects the accuracy of speech emotion recognition, the research on effective features is one of the key issues in speech emotion recognition. Emotional expression and individualized features are often related, so it is often difficult to find generalized effective speech features, which is one of the main research contents of this paper. It is necessary to generate a general emotional feature representation in the speech signal from the perspective of local features and global features: (1) Using the spectrogram and Convolutional Recurrent Neural Network (CRNN) to construct the speech emotion recognition model, which can effectively learn to represent the spatial characteristics of the emotional information and to obtain the aggravated local feature information. (2) Using Low-Level acoustic Descriptors (LLD), through a large number of experiments, the feature representations of limited dimensions such as energy, fundamental frequency, spectrum and statistical features based on these low-level features are screened to obtain the global feature description. (3) Combining the previous features, and verifying the performance of various features in emotion recognition on the Interactive Emotional Dyadic Motion Capture (IEMOCAP) emotional corpus, the accuracy and representativeness of the features obtained in this paper are verified.

引用

页码：535 / 547

页数：13

共 50 条

[1] Feature extraction of emotional speech based on chaotic characteristics
Sun, Ying
Yao, Hui
Zhang, Xueying
Zhang, Qiping
Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2015, 48 (08): : 681 - 685
[2] Research and implementation of emotional feature extraction and recognition in speech signal
Zhan, Y.-Z. (yzzhan@ujs.edu.cn), 2005, Journal of Jiangsu University (Natural Science Edition) (26):
[3] Emotional Speech Recognition Based on Syllable Distribution Feature Extraction
Zhang, Haiying
FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2011), 2011, 122 : 415 - 420
[4] Reduced Feature Extraction for Emotional Speech Recognition
Palo, Hemanta Kumar
Mohanty, Mihir Narayan
2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
[5] Emotional Hindi Speech: Feature Extraction and Classification
Bansal, Sweeta
Dev, Amita
2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 1865 - 1868
[6] RESEARCH ON THE APPLICATION OF SPEECH DATABASE BASED ON EMOTIONAL FEATURE EXTRACTION IN INTERNATIONAL CHINESE EDUCATION AND TEACHING
Zhang, Xiangli
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (01): : 299 - 311
[7] Research on Visual Speech Feature Extraction
He Jun
Zhang Hua
2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY, VOL II, PROCEEDINGS, 2009, : 499 - 502
[8] Emotional feature extraction based on phoneme information for speech emotion recognition
Hyun, Kyang Hak
Kim, Eun Ho
Kwak, Yoon Keun
2007 RO-MAN: 16TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1-3, 2007, : 797 - +
[9] The Extraction Method of Emotional Feature Based on Children's Spoken Speech
Zheng, Chunjun
Jia, Ning
Sun, Wei
2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 1, 2019, : 165 - 168
[10] A Feature Fusion Method for Feature Extraction
Tang, Dejun
Zhang, Weishi
Qu, Xiaolu
Wang, Dujuan
FOURTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2012), 2012, 8334

← 1 2 3 4 5 →