Recognition of emotion from speech using evolutionary cepstral coefficients

被引:5
|
作者
Bakhshi, Ali [1 ]
Chalup, Stephan [1 ]
Harimi, Ali [2 ]
Mirhassani, Seyed Mostafa [3 ]
机构
[1] Univ Newcastle, Sch Elect Engn & Comp, Newcastle, NSW, Australia
[2] Islamic Azad Univ, Dept Elect Engn, Shahrood Branch, Shahrood, Iran
[3] Univ Malaya, Dept Biomed Engn, Kuala Lumpur, Malaysia
关键词
Genetic algorithm; Mel filterbank; Cepstral coefficients; Speech emotion recognition; SPECTRAL FEATURES; FEATURE-EXTRACTION; NEURAL-NETWORK; CLASSIFICATION; ALGORITHM; FUSION; MFCC;
D O I
10.1007/s11042-020-09591-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An optimal representation of acoustic features is an ongoing challenge in automatic speech emotion recognition research. In this study, we proposed Cepstral coefficients based on evolutionary filterbanks as emotional features. It is difficult to guarantee that an individual optimized filterbank provides the best representation for emotion classification. Consequently, we employed six HMM-based binary classifiers that used a specific filterbank, which was optimized by a genetic algorithm to categorize the data into seven emotion classes. These optimized classifiers were applied in a hierarchical manner and outperformed conventional Mel Frequency Cepstral Coefficients in terms of overall emotion classification accuracy. The proposed method using evolutionary-based Cepstral coefficients achieved a weighted average recall of 87.29% on the Berlin database while the same approach but using conventional Cepstral features achieved only 79.63%.
引用
收藏
页码:35739 / 35759
页数:21
相关论文
共 50 条
  • [1] Recognition of emotion from speech using evolutionary cepstral coefficients
    Ali Bakhshi
    Stephan Chalup
    Ali Harimi
    Seyed Mostafa Mirhassani
    [J]. Multimedia Tools and Applications, 2020, 79 : 35739 - 35759
  • [2] Emotion Recognition from Speech Signal Using Mel-Frequency Cepstral Coefficients
    Korkmaz, Onur Erdem
    Atasoy, Ayten
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2015, : 1254 - 1257
  • [3] Speech Emotion Recognition Using Gammatone Cepstral Coefficients and Deep Learning Features
    Sharan, Roneel, V
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES, ICMLANT, 2023, : 139 - 142
  • [4] Linear Frequency Residual Cepstral Coefficients for Speech Emotion Recognition
    Hora, Baveet Singh
    Uthiraa, S.
    Patil, Hemant A.
    [J]. SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 116 - 129
  • [5] Recognition of Human Speech Emotion Using Variants of Mel-Frequency Cepstral Coefficients
    Palo, Hemanta Kumar
    Chandra, Mahesh
    Mohanty, Mihir Narayan
    [J]. ADVANCES IN SYSTEMS, CONTROL AND AUTOMATION, 2018, 442 : 491 - 498
  • [6] Using the Lyapunov Exponent from Cepstral Coefficients for Automatic Emotion Recognition
    Zbancioc, Marius Dan
    Feraru, Monica
    [J]. 2014 INTERNATIONAL CONFERENCE AND EXPOSITION ON ELECTRICAL AND POWER ENGINEERING (EPE), 2014, : 110 - 113
  • [7] Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients
    Huang, Yongming
    Wu, Ao
    Zhang, Guobao
    Li, Yue
    [J]. PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 436 - 443
  • [8] Acoustic Emotion Recognition Using Linear and Nonlinear Cepstral Coefficients
    Chenchah, Farah
    Lachiri, Zied
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (11) : 135 - 138
  • [9] Speech Emotion Recognition Using Auditory Spectrogram and Cepstral Features
    Zhao, Shujie
    Yang, Yan
    Cohen, Israel
    Zhang, Lijun
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 136 - 140
  • [10] Fusion of mel and gammatone frequency cepstral coefficients for speech emotion recognition using deep C-RNN
    Kumaran, U.
    Radha Rammohan, S.
    Nagarajan, Senthil Murugan
    Prathik, A.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 303 - 314