Emotion recognition from speech using wavelet packet transform and prosodic features

被引：5

作者：

Gupta, Manish ^{[1
]}

Bharti, Shambhu Shankar ^{[1
]}

Agarwal, Suneeta ^{[1
]}

机构：

[1] Natl Inst Technol, Dept Comp Sci & Engn, Allahabad 211004, UP, India

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2018年 / 35卷 / 02期

关键词：

Pitch; emotions; speech recognition; SVM; Random Forest (RF);

D O I：

10.3233/JIFS-169694

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Emotion is a property by which human beings and machines can be differentiated as machines are emotionless while human beings are not. If the emotion of a speaker is recognized then others can interact accordingly. This paper presents a new approach for recognizing all the six basic emotions (Happy, anger, fear, sadness, boredom and neutral) from the speech signals more effectively. To recognize the emotion of a speaker, pitch value and two wavelet packet feature vectors derived from speech signals are used. Principal Component Analysis (PCA) has been applied to reduce the dimension of feature vectors. Random Forest (RF) and Support Vector Machine (SVM) classifiers are trained separately based on these reduced feature vectors. The experimental results show that the accuracy of emotion recognition with Random Forest classifier is 86.11% while with SVM classifier it is 84.41%. Experimentally, it is also found that clean speech of 1 sec duration is sufficient enough to recognize emotion of the speaker.

引用

下载

页码：1541 / 1553

页数：13

共 50 条

[41] Speech Recognition Based on Wavelet Packet Transform and K-L Expansion
Wang, Xu
Han, Zhiyan
Wang, Han
Ma, Yujuan
2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2490 - 2493
[42] Speech emotion recognition using Ramanujan Fourier Transform
Flower, T. Mary Little
Jaya, T.
APPLIED ACOUSTICS, 2022, 201
[43] Improved Emotion Recognition with Novel Task-Oriented Wavelet Packet Features
Huang, Yongming
Zhang, Guobao
Li, Yue
Wu, Ao
INTELLIGENT COMPUTING THEORY, 2014, 8588 : 706 - 714
[44] Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Features
Starlet Ben Alex
Leena Mary
Ben P. Babu
Circuits, Systems, and Signal Processing, 2020, 39 : 5681 - 5709
[45] Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Features
Ben Alex, Starlet
Mary, Leena
Babu, Ben P.
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (11) : 5681 - 5709
[46] Non-linear Dynamics Characterization from Wavelet Packet Transform for Automatic Recognition of Emotional Speech
Vasquez-Correa, J. C.
Orozco-Arroyave, J. R.
Arias-Londono, J. D.
Vargas-Bonilla, J. F.
Noth, Elmar
RECENT ADVANCES IN NONLINEAR SPEECH PROCESSING, 2016, 48 : 199 - 207
[47] Speech emotion recognition based on deep belief networks and wavelet packet cepstral coefficients
Huang Y.
Wu A.
Zhang G.
Li Y.
1600, UK Simulation Society, Clifton Lane, Nottingham, NG11 8NS, United Kingdom (17): : 28.1 - 28.5
[48] Emotion recognition from telephone speech using acoustic and nonlinear features
Bedoya-Jaramillo, S.
Orozco-Arroyave, J. R.
Arias-Londono, J. D.
Vargas-Bonilla, J. F.
2013 47TH INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2013,
[49] Emotion recognition from speech signals using new harmony features
Yang, B.
Lugger, M.
SIGNAL PROCESSING, 2010, 90 (05) : 1415 - 1423
[50] Speech emotion recognition using multi resolution Hilbert transform based spectral and entropy features
Mishra, Siba Prasad
Warule, Pankaj
Deb, Suman
Applied Acoustics, 2025, 229

← 1 2 3 4 5 →