Emotion recognition from speech using wavelet packet transform and prosodic features

被引:5
|
作者
Gupta, Manish [1 ]
Bharti, Shambhu Shankar [1 ]
Agarwal, Suneeta [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Allahabad 211004, UP, India
关键词
Pitch; emotions; speech recognition; SVM; Random Forest (RF);
D O I
10.3233/JIFS-169694
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion is a property by which human beings and machines can be differentiated as machines are emotionless while human beings are not. If the emotion of a speaker is recognized then others can interact accordingly. This paper presents a new approach for recognizing all the six basic emotions (Happy, anger, fear, sadness, boredom and neutral) from the speech signals more effectively. To recognize the emotion of a speaker, pitch value and two wavelet packet feature vectors derived from speech signals are used. Principal Component Analysis (PCA) has been applied to reduce the dimension of feature vectors. Random Forest (RF) and Support Vector Machine (SVM) classifiers are trained separately based on these reduced feature vectors. The experimental results show that the accuracy of emotion recognition with Random Forest classifier is 86.11% while with SVM classifier it is 84.41%. Experimentally, it is also found that clean speech of 1 sec duration is sufficient enough to recognize emotion of the speaker.
引用
下载
收藏
页码:1541 / 1553
页数:13
相关论文
共 50 条
  • [21] A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
    Zhou, Yu
    Li, Junfeng
    Sun, Yanqing
    Zhang, Jianping
    Yan, Yonghong
    Akagi, Masato
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (10) : 2813 - 2821
  • [22] PERFORMANCE ANALYSIS OF SPECTRAL AND PROSODIC FEATURES AND THEIR FUSION FOR EMOTION RECOGNITION IN SPEECH
    Gaurav, Manish
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 313 - 316
  • [23] Experimental Study in Emotion Recognition using Prosodic Features
    Pavaloi, Ioan
    Musca, Elena
    2015 E-HEALTH AND BIOENGINEERING CONFERENCE (EHB), 2015,
  • [24] Speech Emotion Recognition Based on Wavelet Transform and Improved HMM
    Han Zhiyan
    Wang Jian
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 3156 - 3159
  • [25] Speech recognition using a proposed indexing tree algorithm based on wavelet packet transform
    Mabmoud, W.A.
    Juda, N.R.
    Gagad, N.H.
    Advances in Modelling and Analysis B, 2003, 46 (3-4): : 25 - 36
  • [26] A Novel Emotion Recognizer from Speech Using Both Prosodic and Linguistic Features
    Suzuki, Motoyuki
    Tsuchiya, Seiji
    Ren, Fuji
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT I: 15TH INTERNATIONAL CONFERENCE, KES 2011, 2011, 6881 : 456 - 465
  • [27] Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients
    Huang, Yongming
    Wu, Ao
    Zhang, Guobao
    Li, Yue
    PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 436 - 443
  • [28] Audio Visual Emotion Recognition Using Cross Correlation and Wavelet Packet Domain Features
    Noor, Shamman
    Dhrubo, Ehsan Ahmed
    Minhaz, Ahmed Tahseen
    Shahnaz, Celia
    Fattah, Shaikh Anowarul
    2017 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (IEEE WIECON-ECE 2017), 2017, : 233 - 236
  • [29] Prosodic feature normalization for emotion recognition by using synthesized speech
    Suzuki, Motoyuki
    Nakagawa, Shohei
    Kita, Kenji
    ADVANCES IN KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, 2012, 243 : 306 - 313
  • [30] Study of speech emotion recognition based on prosodic parameters and facial expression features
    Wang, Yu Tai
    Han, Jie
    Jiang, Xiao Qing
    Zou, Jing
    Zhao, Hui
    INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 1677 - 1681