Research on Speech Emotional Feature Extraction Based on Multidimensional Feature Fusion

被引:2
|
作者
Zheng, Chunjun [1 ,2 ]
Wang, Chunli [1 ]
Sun, Wei [2 ]
Jia, Ning [2 ]
机构
[1] Dalian Maritime Univ, Dalian, Liaoning, Peoples R China
[2] Dalian Neusoft Univ Informat, Dalian, Liaoning, Peoples R China
关键词
Low-Level Acoustic Descriptors; Convolutional Recurrent Neural Network; Feature Fusion; Speech emotion recognition; RECOGNITION;
D O I
10.1007/978-3-030-35231-8_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of speech processing, speech emotion recognition is a challenging task with broad application prospects. Since the effective speech feature set directly affects the accuracy of speech emotion recognition, the research on effective features is one of the key issues in speech emotion recognition. Emotional expression and individualized features are often related, so it is often difficult to find generalized effective speech features, which is one of the main research contents of this paper. It is necessary to generate a general emotional feature representation in the speech signal from the perspective of local features and global features: (1) Using the spectrogram and Convolutional Recurrent Neural Network (CRNN) to construct the speech emotion recognition model, which can effectively learn to represent the spatial characteristics of the emotional information and to obtain the aggravated local feature information. (2) Using Low-Level acoustic Descriptors (LLD), through a large number of experiments, the feature representations of limited dimensions such as energy, fundamental frequency, spectrum and statistical features based on these low-level features are screened to obtain the global feature description. (3) Combining the previous features, and verifying the performance of various features in emotion recognition on the Interactive Emotional Dyadic Motion Capture (IEMOCAP) emotional corpus, the accuracy and representativeness of the features obtained in this paper are verified.
引用
收藏
页码:535 / 547
页数:13
相关论文
共 50 条
  • [1] Feature extraction of emotional speech based on chaotic characteristics
    Sun, Ying
    Yao, Hui
    Zhang, Xueying
    Zhang, Qiping
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2015, 48 (08): : 681 - 685
  • [2] Research and implementation of emotional feature extraction and recognition in speech signal
    Zhan, Y.-Z. (yzzhan@ujs.edu.cn), 2005, Journal of Jiangsu University (Natural Science Edition) (26):
  • [3] Emotional Speech Recognition Based on Syllable Distribution Feature Extraction
    Zhang, Haiying
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2011), 2011, 122 : 415 - 420
  • [4] Reduced Feature Extraction for Emotional Speech Recognition
    Palo, Hemanta Kumar
    Mohanty, Mihir Narayan
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [5] Emotional Hindi Speech: Feature Extraction and Classification
    Bansal, Sweeta
    Dev, Amita
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 1865 - 1868
  • [6] RESEARCH ON THE APPLICATION OF SPEECH DATABASE BASED ON EMOTIONAL FEATURE EXTRACTION IN INTERNATIONAL CHINESE EDUCATION AND TEACHING
    Zhang, Xiangli
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (01): : 299 - 311
  • [7] Research on Visual Speech Feature Extraction
    He Jun
    Zhang Hua
    2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY, VOL II, PROCEEDINGS, 2009, : 499 - 502
  • [8] Emotional feature extraction based on phoneme information for speech emotion recognition
    Hyun, Kyang Hak
    Kim, Eun Ho
    Kwak, Yoon Keun
    2007 RO-MAN: 16TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1-3, 2007, : 797 - +
  • [9] The Extraction Method of Emotional Feature Based on Children's Spoken Speech
    Zheng, Chunjun
    Jia, Ning
    Sun, Wei
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 1, 2019, : 165 - 168
  • [10] A Feature Fusion Method for Feature Extraction
    Tang, Dejun
    Zhang, Weishi
    Qu, Xiaolu
    Wang, Dujuan
    FOURTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2012), 2012, 8334