Emotion recognition from speech using wavelet packet transform and prosodic features

被引:5
|
作者
Gupta, Manish [1 ]
Bharti, Shambhu Shankar [1 ]
Agarwal, Suneeta [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Allahabad 211004, UP, India
关键词
Pitch; emotions; speech recognition; SVM; Random Forest (RF);
D O I
10.3233/JIFS-169694
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion is a property by which human beings and machines can be differentiated as machines are emotionless while human beings are not. If the emotion of a speaker is recognized then others can interact accordingly. This paper presents a new approach for recognizing all the six basic emotions (Happy, anger, fear, sadness, boredom and neutral) from the speech signals more effectively. To recognize the emotion of a speaker, pitch value and two wavelet packet feature vectors derived from speech signals are used. Principal Component Analysis (PCA) has been applied to reduce the dimension of feature vectors. Random Forest (RF) and Support Vector Machine (SVM) classifiers are trained separately based on these reduced feature vectors. The experimental results show that the accuracy of emotion recognition with Random Forest classifier is 86.11% while with SVM classifier it is 84.41%. Experimentally, it is also found that clean speech of 1 sec duration is sufficient enough to recognize emotion of the speaker.
引用
收藏
页码:1541 / 1553
页数:13
相关论文
共 50 条
  • [1] Egyptian Arabic speech emotion recognition using prosodic, spectral and wavelet features
    Abdel-Hamid, Lamiaa
    [J]. SPEECH COMMUNICATION, 2020, 122 : 19 - 30
  • [2] Emotion Recognition from Speech using Prosodic and Linguistic Features
    Pervaiz, Mahwish
    Khan, Tamim Ahmed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 84 - 90
  • [3] Emotion recognition from speech using global and local prosodic features
    Rao K.S.
    Koolagudi S.G.
    Vempada R.R.
    [J]. International Journal of Speech Technology, 2013, 16 (2) : 143 - 160
  • [4] Emotion recognition from speech using source, system, and prosodic features
    Koolagudi, Shashidhar G.
    Rao, K. Sreenivasa
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) : 265 - 289
  • [5] Emotion recognition from speech using source, system, and prosodic features
    Shashidhar G. Koolagudi
    K. Sreenivasa Rao
    [J]. International Journal of Speech Technology, 2012, 15 (2) : 265 - 289
  • [6] Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients
    Sen, Tjong Wan
    Trilaksono, Bambang Riyanto
    Arman, Arry Akhmad
    Mandala, Rila
    [J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2009, 3 (02) : 123 - 134
  • [7] Performance Analysis of Emotion Recognition from Speech Using Combined Prosodic Features
    Palo, Hemanta K.
    Mohanty, Mihir N.
    [J]. ADVANCED SCIENCE LETTERS, 2016, 22 (02) : 288 - 293
  • [8] Emotion Recognition From Speech Using Wavelet Packet Transform Cochlear Filter Bank and Random Forest Classifier
    Hamsa, Shibani
    Shahin, Ismail
    Iraqi, Youssef
    Werghi, Naoufel
    [J]. IEEE ACCESS, 2020, 8 : 96994 - 97006
  • [9] Emotion Recognition in Speech Using MFCC and Wavelet Features
    Kishore, K. V. Krishna
    Satish, P. Krishna
    [J]. PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 842 - 847
  • [10] Emotion modeling from speech signal based on wavelet packet transform
    Degaonkar, Varsha
    Apte, Shaila
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2013, 16 (01) : 1 - 5