Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system

被引：0

作者：

Zhiyou Yang

Ying Huang

机构：

[1] Liuzhou Railway Vocational Technical College,Electronic Information School

[2] Wuhan University,undefined

来源：

Evolutionary Intelligence | 2022年 / 15卷

关键词：

Speech emotion recognition; Broad learning system; Human–computer interaction; MFCC; Classification;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Speech plays a major role in emotional transmitting information in humans, and speech emotion recognition has become an important part of the human–computer system, especially in specific systems with high requirements for real-time and accuracy. To improve the accuracy and real-time of speech emotion recognition, people have done a lot of work in speech emotion feature extraction and speech emotion recognition algorithms, but the recognition rate also needs improvement. In this paper, we propose a speech emotion recognition method based on Mel-frequency Cepstral coefficients (MFCC) and broad learning network. 39-dimensional MFCC features were extracted after preprocess of the speech signal. After labelling and standardizing the data, a data prediction model is built. Finally, the data set is split into training and test data onto a certain ratio (0.8). We experimented with broad learning network architecture. And then the data processing in the broad learning network is improved. The proposed algorithm is a neural network structure that does not rely on deep structure, which has a small amount of calculation, excellent calculation speed and simple structure. The experimental results show that the proposed network architecture achieves higher accuracy and it turned out to be the most accurate in recognizing emotions in CASIA Chinese emotion corpus. The recognition rate can reach 100%. Therefore, the proposed network architecture provides an effective method of speech emotion recognition.

引用

页码：2485 / 2494

页数：9

共 50 条

[31] Using Mel-Frequency Cepstral Coefficients in Missing Data Technique
Zhang Jun
Sam Kwong
Wei Gang
Qingyang Hong
EURASIP Journal on Advances in Signal Processing, 2004
[32] Multiple time resolutions for derivatives of mel-frequency cepstral coefficients
Stemmer, G
Hacker, C
Nöth, E
Niemann, H
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 37 - 40
[33] Speaker independent phoneme recognition based on fractal dimension (DF) and the mel-frequency cepstral coefficients features
Fekkai, S
Al-Akaidi, M
Blackledge, JM
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4014 - 4014
[34] Using Mel-frequency cepstral coefficients in missing data technique
Jun, Z
Kwong, S
Gang, W
Hong, QY
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (03) : 340 - 346
[35] Low Bit-Rate Speech Coding Through Quantization of Mel-Frequency Cepstral Coefficients
Boucheron, Laura E.
De Leon, Phillip L.
Sandoval, Steven
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 610 - 619
[36] Variants of Mel-frequency Cepstral Coefficients for Improved Whispered Speech Speaker Verification in Mismatched Conditions
Sarria-Paja, Milton
Falk, Tiago H.
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 91 - 95
[37] Linear Frequency Residual Cepstral Coefficients for Speech Emotion Recognition
Hora, Baveet Singh
Uthiraa, S.
Patil, Hemant A.
SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 116 - 129
[38] Classification of Heart Sounds using Linear Prediction Coefficients and Mel-Frequency Cepstral Coefficients as Acoustic Features
Narvaez, Pedro
Vera, Katerine
Bedoya, Nhikolas
Percybrooks, Winston S.
2017 IEEE COLOMBIAN CONFERENCE ON COMMUNICATIONS AND COMPUTING (COLCOM), 2017,
[39] Mel-Frequency Cepstral Coefficient-Based Bandwidth Extension of Narrowband Speech
Nour-Eldin, Amr H.
Kabal, Peter
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 53 - 56
[40] UNDERSTANDING SARCASM IN SPEECH USING MEL-FREQUENCY CEPSTRAL COEFFICENT
Mathur, Abhinav
Saxena, Vikas
Singh, Sandeep K.
PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 728 - 732

← 1 2 3 4 5 →