Variational mode decomposition based acoustic and entropy features for speech emotion recognition

被引:11
|
作者
Mishra, Siba Prasad [1 ]
Warule, Pankaj [1 ]
Deb, Suman [1 ]
机构
[1] Sardar Vallabhbhai Natl Inst Technol, Surat, Gujarat, India
关键词
Deep neural network; Speech emotion recognition; MFCC; Permutation entropy; Approximate entropy; APPROXIMATE ENTROPY; FEATURE-EXTRACTION; CLASSIFICATION; DEEP;
D O I
10.1016/j.apacoust.2023.109578
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automated speech emotion recognition (SER) is a machine-based method for identifying emotion from speech signals. SER has many practical applications, including improving man-machine interaction (MMI), online customer support, healthcare services, online marketing, etc. Because of the wide range of applications, the popularity of SER has been increasing among researchers for three decades. Numerous studies employed various combinations of features and classifiers to improve emotion classification performance. In our study, we tried to achieve the same by using variational mode decomposition (VMD)-based features. We extracted features like MFCC, mel-spectrogram, approximate entropy (ApEn), and permutation entropy (PrEn) from each VMD mode. The performance of emotion classification is evaluated using the deep neural network (DNN) classifier and the proposed VMD-based features individually (MFCC, mel-spectrogram, ApEn, and PrEn) and in combination (MFCC + mel-spectrogram + ApEn + PrEn). We used two datasets, RAVDESS and EMO-DB, to evaluate the emotion classification performance and obtained a classification accuracy of 91.59% and 80.83% for the EMO-DB and RAVDESS datasets, respectively. Our experimental results were compared with the other methods, and we found that the proposed VMD-based feature combinations with a DNN classifier performed better than the state-of-the-art works in SER.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Speech emotion recognition using a combination of variational mode decomposition and Hilbert transform
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    APPLIED ACOUSTICS, 2024, 222
  • [2] An Extended Variational Mode Decomposition Algorithm Developed Speech Emotion Recognition Performance
    Rudd, David Hason
    Huo, Huan
    Xu, Guandong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III, 2023, 13937 : 219 - 231
  • [3] Electroencephalogram Emotion Recognition Using Combined Features in Variational Mode Decomposition Domain
    Liu, Zhen-Tao
    Hu, Si-Jun
    She, Jinhua
    Yang, Zhaohui
    Xu, Xin
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (03) : 1595 - 1604
  • [4] Emotion classification from speech signal based on empirical mode decomposition and non-linear features Speech emotion recognition
    Krishnan, Palani Thanaraj
    Alex Noel, Joseph Raj
    Rajangam, Vijayarajan
    COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (04) : 1919 - 1934
  • [5] Novel acoustic features for speech emotion recognition
    ROH Yong-Wan
    KIM Dong-Ju
    LEE Woo-Seok
    HONG Kwang-Seok
    Science China Technological Sciences, 2009, 52 (07) : 1838 - 1848
  • [6] Novel acoustic features for speech emotion recognition
    Yong-Wan Roh
    Dong-Ju Kim
    Woo-Seok Lee
    Kwang-Seok Hong
    Science in China Series E: Technological Sciences, 2009, 52 : 1838 - 1848
  • [7] SPEECH EMOTION RECOGNITION WITH ACOUSTIC AND LEXICAL FEATURES
    Jin, Qin
    Li, Chengxin
    Chen, Shizhe
    Wu, Huimin
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4749 - 4753
  • [8] Novel acoustic features for speech emotion recognition
    Roh Yong-Wan
    Kim Dong-Ju
    Lee Woo-Seok
    Hong Kwang-Seok
    SCIENCE IN CHINA SERIES E-TECHNOLOGICAL SCIENCES, 2009, 52 (07): : 1838 - 1848
  • [9] Fixed frequency range empirical wavelet transform based acoustic and entropy features for speech emotion recognition
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    Speech Communication, 2025, 166
  • [10] An Evolutionary Optimized Variational Mode Decomposition for Emotion Recognition
    Khare, Smith K.
    Bajaj, Varun
    IEEE SENSORS JOURNAL, 2021, 21 (02) : 2035 - 2042