Emotion classification from speech signal based on empirical mode decomposition and non-linear featuresSpeech emotion recognition

被引:0
|
作者
Palani Thanaraj Krishnan
Alex Noel Joseph Raj
Vijayarajan Rajangam
机构
[1] St. Joseph’s College of Engineering,Department of Electronics and Instrumentation Engineering
[2] Shantou University,Department of Electronic Engineering
[3] Vellore Institute of Technology,Division of Healthcare Advancement, Innovation and Research
来源
关键词
Speech signal; Emotion perception; Entropy measures; Linear discriminant analysis; Empirical mode decomposition;
D O I
暂无
中图分类号
学科分类号
摘要
Emotion recognition system from speech signal is a widely researched topic in the design of the Human–Computer Interface (HCI) models, since it provides insights into the mental states of human beings. Often, it is required to identify the emotional condition of the humans as cognitive feedback in the HCI. In this paper, an attempt to recognize seven emotional states from speech signals, known as sad, angry, disgust, happy, surprise, pleasant, and neutral sentiment, is investigated. The proposed method employs a non-linear signal quantifying method based on randomness measure, known as the entropy feature, for the detection of emotions. Initially, the speech signals are decomposed into Intrinsic Mode Function (IMF), where the IMF signals are divided into dominant frequency bands such as the high frequency, mid-frequency , and base frequency. The entropy measures are computed directly from the high-frequency band in the IMF domain. However, for the mid- and base-band frequencies, the IMFs are averaged and their entropy measures are computed. A feature vector is formed from the computed entropy measures incorporating the randomness feature for all the emotional signals. Then, the feature vector is used to train a few state-of-the-art classifiers, such as Linear Discriminant Analysis (LDA), Naïve Bayes, K-Nearest Neighbor, Support Vector Machine, Random Forest, and Gradient Boosting Machine. A tenfold cross-validation, performed on a publicly available Toronto Emotional Speech dataset, illustrates that the LDA classifier presents a peak balanced accuracy of 93.3%, F1 score of 87.9%, and an area under the curve value of 0.995 in the recognition of emotions from speech signals of native English speakers.
引用
收藏
页码:1919 / 1934
页数:15
相关论文
共 50 条
  • [1] Correction to: Emotion classification from speech signal based on empirical mode decomposition and non-linear featuresSpeech emotion recognition
    Palani Thanaraj Krishnan
    Alex Noel Joseph Raj
    Vijayarajan Rajangam
    [J]. Complex & Intelligent Systems, 2022, 8 : 703 - 703
  • [2] Emotion classification from speech signal based on empirical mode decomposition and non-linear features Speech emotion recognition
    Krishnan, Palani Thanaraj
    Alex Noel, Joseph Raj
    Rajangam, Vijayarajan
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (04) : 1919 - 1934
  • [3] Emotion classification from speech signal based on empirical mode decomposition and non-linear features Speech emotion recognition (Feb, 10.1007/s40747-021-00295-z, 2021)
    Krishnan, Palani Thanaraj
    Joseph Raj, Alex Noel
    Rajangam, Vijayarajan
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (01) : 703 - 703
  • [4] Recognition of Emotion Using Non-Linear Dynamics of Speech
    Harimi, Ali
    Shalizadi, Ali
    Ahmadyfard, Alireza
    [J]. 2014 7th International Symposium on Telecommunications (IST), 2014, : 446 - 451
  • [5] Empirical mode decomposition based weighted frequency feature for speech-based emotion classification
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathamby
    Epps, Julien
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5017 - 5020
  • [6] Emotion Recognition for Brain Machine Interface: Non-linear Spectral Analysis of EEG Signals Using Empirical Mode Decomposition
    Carella, Tommaso
    De Silvestri, Matteo
    Finedore, Mary
    Haniff, Isaac
    Esmailbeigi, Hananeh
    [J]. 2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 223 - 226
  • [7] Ensemble Median Empirical Mode Decomposition for Emotion Recognition Using EEG Signal
    Samal, Priyadarsini
    Hashmi, Mohammad Farukh
    [J]. IEEE SENSORS LETTERS, 2023, 7 (05)
  • [8] Emotion Recognition from Speech Signal
    Ramdinmawii, Esther
    Mohanta, Abhijit
    Mittal, Vinay Kumar
    [J]. TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1562 - 1567
  • [9] Emotion recognition of electroencephalogram signals based on empirical mode decomposition and wavelet
    Zhang, X. D.
    She, Y. C.
    Zhu, L.
    Liu, G. Z.
    Ke, X. Z.
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 123 : 75 - 76
  • [10] Facial emotion recognition using empirical mode decomposition
    Ali, Hasimah
    Hariharan, Muthusamy
    Yaacob, Sazali
    Adom, Abdul Hamid
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (03) : 1261 - 1277