Recognition of Human Speech Emotion Using Variants of Mel-Frequency Cepstral Coefficients

被引:17
|
作者
Palo, Hemanta Kumar [1 ]
Chandra, Mahesh [2 ]
Mohanty, Mihir Narayan [1 ]
机构
[1] Siksha O Anusandhan Univ, Dept Elect & Commun Engn, Bhubaneswar, Odisha, India
[2] Birla Inst Technol, Dept Elect & Commun Engn, Ranchi, Bihar, India
关键词
Human speech emotion; Mel-frequency cepstral coefficient; Probabilistic neural network; Feature extraction; Wavelet analysis;
D O I
10.1007/978-981-10-4762-6_47
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this chapter, different variants of Mel-frequency cepstral coefficients (MFCCs) describing human speech emotions are investigated. These features are tested and compared for their robustness in terms of classification accuracy and mean square error. Although MFCC is a reliable feature for speech emotion recognition, it does not consider the temporal dynamics between features which is crucial for such analysis. To address this issue, delta MFCC as its first derivative is extracted for comparison. Due to poor performance of MFCC under noisy condition, both MFCC and delta MFCC features are extracted in wavelet domain in the second phase. Time-frequency characterization of emotions using wavelet analysis and energy or amplitude information using MFCC-based features has enhanced the available information. Wavelet-based MFCCs (WMFCCs) and wavelet-based delta MFCCs (WDMFCCs) outperformed standard MFCCs, delta MFCCs, and wavelets in recognition of Berlin speech emotional utterances. Probabilistic neural network (PNN) has been chosen to model the emotions as the classifier is simple to train, much faster, and allows flexible selection of smoothing parameter than other neural network (NN) models. Highest accuracy of 80.79% has been observed with WDMFCCs as compared to 60.97 and 62.76% with MFCCs and wavelets, respectively.
引用
收藏
页码:491 / 498
页数:8
相关论文
共 50 条
  • [1] Emotion Recognition from Speech Signal Using Mel-Frequency Cepstral Coefficients
    Korkmaz, Onur Erdem
    Atasoy, Ayten
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2015, : 1254 - 1257
  • [2] Fingerprint Recognition Using Mel-Frequency Cepstral Coefficients
    Hashad F.G.
    Halim T.M.
    Diab S.M.
    Sallam B.M.
    El-Samie F.E.A.
    [J]. Pattern Recognition and Image Analysis, 2010, 20 (03) : 360 - 369
  • [3] Voice Recognition and Marking Using Mel-frequency Cepstral Coefficients
    Sheu, Jia-Shing
    Chen, Ching-Wen
    [J]. SENSORS AND MATERIALS, 2020, 32 (10) : 3209 - 3220
  • [4] Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system
    Yang, Zhiyou
    Huang, Ying
    [J]. EVOLUTIONARY INTELLIGENCE, 2022, 15 (04) : 2485 - 2494
  • [5] Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system
    Zhiyou Yang
    Ying Huang
    [J]. Evolutionary Intelligence, 2022, 15 : 2485 - 2494
  • [6] Mel-Frequency Cepstral Coefficient Analysis in Speech Recognition
    On, Chin Kim
    Pandiyan, Paulraj M.
    Yaacob, Sazali
    Saudi, Azali
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS (ICOCI 2006), 2006, : 291 - +
  • [7] On the Inversion of Mel-Frequency Cepstral Coefficients for Speech Enhancement Applications
    Boucheron, Laura E.
    De Leon, Phillip L.
    [J]. ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 485 - 488
  • [8] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    [J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424
  • [9] Variants of Mel-frequency Cepstral Coefficients for Improved Whispered Speech Speaker Verification in Mismatched Conditions
    Sarria-Paja, Milton
    Falk, Tiago H.
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 91 - 95
  • [10] Automatic recognition of birdsongs using mel-frequency cepstral coefficients and vector quantization
    Lee, Chang-Hsing
    Lien, Cheng-Chang
    Huang, Ren-Zhuang
    [J]. IMECS 2006: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, 2006, : 331 - +