Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system

被引:0
|
作者
Zhiyou Yang
Ying Huang
机构
[1] Liuzhou Railway Vocational Technical College,Electronic Information School
[2] Wuhan University,undefined
来源
Evolutionary Intelligence | 2022年 / 15卷
关键词
Speech emotion recognition; Broad learning system; Human–computer interaction; MFCC; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
Speech plays a major role in emotional transmitting information in humans, and speech emotion recognition has become an important part of the human–computer system, especially in specific systems with high requirements for real-time and accuracy. To improve the accuracy and real-time of speech emotion recognition, people have done a lot of work in speech emotion feature extraction and speech emotion recognition algorithms, but the recognition rate also needs improvement. In this paper, we propose a speech emotion recognition method based on Mel-frequency Cepstral coefficients (MFCC) and broad learning network. 39-dimensional MFCC features were extracted after preprocess of the speech signal. After labelling and standardizing the data, a data prediction model is built. Finally, the data set is split into training and test data onto a certain ratio (0.8). We experimented with broad learning network architecture. And then the data processing in the broad learning network is improved. The proposed algorithm is a neural network structure that does not rely on deep structure, which has a small amount of calculation, excellent calculation speed and simple structure. The experimental results show that the proposed network architecture achieves higher accuracy and it turned out to be the most accurate in recognizing emotions in CASIA Chinese emotion corpus. The recognition rate can reach 100%. Therefore, the proposed network architecture provides an effective method of speech emotion recognition.
引用
收藏
页码:2485 / 2494
页数:9
相关论文
共 50 条
  • [31] Using Mel-Frequency Cepstral Coefficients in Missing Data Technique
    Zhang Jun
    Sam Kwong
    Wei Gang
    Qingyang Hong
    EURASIP Journal on Advances in Signal Processing, 2004
  • [32] Multiple time resolutions for derivatives of mel-frequency cepstral coefficients
    Stemmer, G
    Hacker, C
    Nöth, E
    Niemann, H
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 37 - 40
  • [33] Speaker independent phoneme recognition based on fractal dimension (DF) and the mel-frequency cepstral coefficients features
    Fekkai, S
    Al-Akaidi, M
    Blackledge, JM
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4014 - 4014
  • [34] Using Mel-frequency cepstral coefficients in missing data technique
    Jun, Z
    Kwong, S
    Gang, W
    Hong, QY
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (03) : 340 - 346
  • [35] Low Bit-Rate Speech Coding Through Quantization of Mel-Frequency Cepstral Coefficients
    Boucheron, Laura E.
    De Leon, Phillip L.
    Sandoval, Steven
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 610 - 619
  • [36] Variants of Mel-frequency Cepstral Coefficients for Improved Whispered Speech Speaker Verification in Mismatched Conditions
    Sarria-Paja, Milton
    Falk, Tiago H.
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 91 - 95
  • [37] Linear Frequency Residual Cepstral Coefficients for Speech Emotion Recognition
    Hora, Baveet Singh
    Uthiraa, S.
    Patil, Hemant A.
    SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 116 - 129
  • [38] Classification of Heart Sounds using Linear Prediction Coefficients and Mel-Frequency Cepstral Coefficients as Acoustic Features
    Narvaez, Pedro
    Vera, Katerine
    Bedoya, Nhikolas
    Percybrooks, Winston S.
    2017 IEEE COLOMBIAN CONFERENCE ON COMMUNICATIONS AND COMPUTING (COLCOM), 2017,
  • [39] Mel-Frequency Cepstral Coefficient-Based Bandwidth Extension of Narrowband Speech
    Nour-Eldin, Amr H.
    Kabal, Peter
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 53 - 56
  • [40] UNDERSTANDING SARCASM IN SPEECH USING MEL-FREQUENCY CEPSTRAL COEFFICENT
    Mathur, Abhinav
    Saxena, Vikas
    Singh, Sandeep K.
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 728 - 732