Emotion classification from speech signal based on empirical mode decomposition and non-linear featuresSpeech emotion recognition

被引：0

作者：

Palani Thanaraj Krishnan

Alex Noel Joseph Raj

Vijayarajan Rajangam

机构：

[1] St. Joseph’s College of Engineering,Department of Electronics and Instrumentation Engineering

[2] Shantou University,Department of Electronic Engineering

[3] Vellore Institute of Technology,Division of Healthcare Advancement, Innovation and Research

来源：

Complex & Intelligent Systems | 2021年 / 7卷

关键词：

Speech signal; Emotion perception; Entropy measures; Linear discriminant analysis; Empirical mode decomposition;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Emotion recognition system from speech signal is a widely researched topic in the design of the Human–Computer Interface (HCI) models, since it provides insights into the mental states of human beings. Often, it is required to identify the emotional condition of the humans as cognitive feedback in the HCI. In this paper, an attempt to recognize seven emotional states from speech signals, known as sad, angry, disgust, happy, surprise, pleasant, and neutral sentiment, is investigated. The proposed method employs a non-linear signal quantifying method based on randomness measure, known as the entropy feature, for the detection of emotions. Initially, the speech signals are decomposed into Intrinsic Mode Function (IMF), where the IMF signals are divided into dominant frequency bands such as the high frequency, mid-frequency , and base frequency. The entropy measures are computed directly from the high-frequency band in the IMF domain. However, for the mid- and base-band frequencies, the IMFs are averaged and their entropy measures are computed. A feature vector is formed from the computed entropy measures incorporating the randomness feature for all the emotional signals. Then, the feature vector is used to train a few state-of-the-art classifiers, such as Linear Discriminant Analysis (LDA), Naïve Bayes, K-Nearest Neighbor, Support Vector Machine, Random Forest, and Gradient Boosting Machine. A tenfold cross-validation, performed on a publicly available Toronto Emotional Speech dataset, illustrates that the LDA classifier presents a peak balanced accuracy of 93.3%, F1 score of 87.9%, and an area under the curve value of 0.995 in the recognition of emotions from speech signals of native English speakers.

引用

页码：1919 / 1934

页数：15

共 50 条

[1] Correction to: Emotion classification from speech signal based on empirical mode decomposition and non-linear featuresSpeech emotion recognition
Palani Thanaraj Krishnan
Alex Noel Joseph Raj
Vijayarajan Rajangam
[J]. Complex & Intelligent Systems, 2022, 8 : 703 - 703
[2] Emotion classification from speech signal based on empirical mode decomposition and non-linear features Speech emotion recognition
Krishnan, Palani Thanaraj
Alex Noel, Joseph Raj
Rajangam, Vijayarajan
[J]. COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (04) : 1919 - 1934
[3] Emotion classification from speech signal based on empirical mode decomposition and non-linear features Speech emotion recognition (Feb, 10.1007/s40747-021-00295-z, 2021)
Krishnan, Palani Thanaraj
Joseph Raj, Alex Noel
Rajangam, Vijayarajan
[J]. COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (01) : 703 - 703
[4] Recognition of Emotion Using Non-Linear Dynamics of Speech
Harimi, Ali
Shalizadi, Ali
Ahmadyfard, Alireza
[J]. 2014 7th International Symposium on Telecommunications (IST), 2014, : 446 - 451
[5] Empirical mode decomposition based weighted frequency feature for speech-based emotion classification
Sethu, Vidhyasaharan
Ambikairajah, Eliathamby
Epps, Julien
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5017 - 5020
[6] Emotion Recognition for Brain Machine Interface: Non-linear Spectral Analysis of EEG Signals Using Empirical Mode Decomposition
Carella, Tommaso
De Silvestri, Matteo
Finedore, Mary
Haniff, Isaac
Esmailbeigi, Hananeh
[J]. 2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 223 - 226
[7] Ensemble Median Empirical Mode Decomposition for Emotion Recognition Using EEG Signal
Samal, Priyadarsini
Hashmi, Mohammad Farukh
[J]. IEEE SENSORS LETTERS, 2023, 7 (05)
[8] Emotion Recognition from Speech Signal
Ramdinmawii, Esther
Mohanta, Abhijit
Mittal, Vinay Kumar
[J]. TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1562 - 1567
[9] Emotion recognition of electroencephalogram signals based on empirical mode decomposition and wavelet
Zhang, X. D.
She, Y. C.
Zhu, L.
Liu, G. Z.
Ke, X. Z.
[J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 123 : 75 - 76
[10] Facial emotion recognition using empirical mode decomposition
Ali, Hasimah
Hariharan, Muthusamy
Yaacob, Sazali
Adom, Abdul Hamid
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (03) : 1261 - 1277

← 1 2 3 4 5 →