Emotion recognition from speech using wavelet packet transform and prosodic features

被引:5
|
作者
Gupta, Manish [1 ]
Bharti, Shambhu Shankar [1 ]
Agarwal, Suneeta [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Allahabad 211004, UP, India
关键词
Pitch; emotions; speech recognition; SVM; Random Forest (RF);
D O I
10.3233/JIFS-169694
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion is a property by which human beings and machines can be differentiated as machines are emotionless while human beings are not. If the emotion of a speaker is recognized then others can interact accordingly. This paper presents a new approach for recognizing all the six basic emotions (Happy, anger, fear, sadness, boredom and neutral) from the speech signals more effectively. To recognize the emotion of a speaker, pitch value and two wavelet packet feature vectors derived from speech signals are used. Principal Component Analysis (PCA) has been applied to reduce the dimension of feature vectors. Random Forest (RF) and Support Vector Machine (SVM) classifiers are trained separately based on these reduced feature vectors. The experimental results show that the accuracy of emotion recognition with Random Forest classifier is 86.11% while with SVM classifier it is 84.41%. Experimentally, it is also found that clean speech of 1 sec duration is sufficient enough to recognize emotion of the speaker.
引用
收藏
页码:1541 / 1553
页数:13
相关论文
共 50 条
  • [31] Study of speech emotion recognition based on prosodic parameters and facial expression features
    Wang, Yu Tai
    Han, Jie
    Jiang, Xiao Qing
    Zou, Jing
    Zhao, Hui
    [J]. INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 1677 - 1681
  • [32] Automatic Emotion Recognition using Auditory and Prosodic Indicative Features
    Gharsellaoui, Soumaya
    Selouani, Sid-Ahmed
    Dahmane, Adel Omar
    [J]. 2015 IEEE 28TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2015, : 1265 - 1270
  • [33] Acoustic-Prosodic Recognition of Emotion in Speech
    Montenegro, Chuchi S.
    Maravillas, Elmer A.
    [J]. 2015 INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY,COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2015, : 527 - +
  • [34] Speech Emotion Recognition Based on LSTM and Mel Scale Wavelet Packet Decomposition
    Feng, Tian
    Yang, Shuying
    [J]. 2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [35] A new speech enhancement algorithm using wavelet packet transform
    Guo, Jichang
    Wang, Wenliang
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 504 - 506
  • [36] Dialect recognition from Telugu speech utterances using spectral and prosodic features
    Shivaprasad, S.
    Sadanandam, M.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 27 (2) : 515 - 515
  • [37] Robust Perceptual Wavelet Packet Features for Recognition of Continuous Kannada Speech
    D. J. Mahadevaswamy
    [J]. Wireless Personal Communications, 2021, 121 : 1781 - 1804
  • [38] Robust Perceptual Wavelet Packet Features for Recognition of Continuous Kannada Speech
    Mahadevaswamy
    Ravi, D. J.
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2021, 121 (03) : 1781 - 1804
  • [39] Speech Emotion Recognition using Combination of Features
    Zhang, Qingli
    An, Ning
    Wang, Kunxia
    Ren, Fuji
    Li, Lian
    [J]. PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 523 - 528
  • [40] Speech Recognition Based on Wavelet Packet Transform and K-L Expansion
    Wang, Xu
    Han, Zhiyan
    Wang, Han
    Ma, Yujuan
    [J]. 2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2490 - 2493