Improving the performance of the speaker emotion recognition based on low dimension prosody features vector

被引:6
|
作者
Gudmalwar, Ashishkumar Prabhakar [1 ]
Rao, Ch V. Rama [1 ]
Dutta, Anirban [1 ]
机构
[1] Natl Inst Technol, Shillong, Meghalaya, India
关键词
Prosody; PCA; Emotion recognition; Recognition rate; SPEECH;
D O I
10.1007/s10772-018-09576-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker emotion recognition is an important research issue as it finds lots of applications in human-robot interaction, computer-human interaction, etc. This work deals with the recognition of emotion of the speaker from speech utterance. For that features like pitch, log energy, zero crossing rate, and first three formant frequencies are used. Feature vectors are constructed using the 11 statistical parameters of each feature. The Artificial Neural Network (ANN) is chosen as a classifier owing to its universal function approximation capabilities. In ANN based classifier, the time required for training the network as well as for classification depends upon the dimension of feature vector. This work focused on development of a speaker emotion recognition system using prosody features as well as reduction of dimensionality of feature vectors. Here, principle component analysis (PCA) is used for feature vector dimensionality reduction. Emotional prosody speech and transcription from Linguistic Data Consortium (LDC) and Berlin emotional databases are considered for evaluating the performance of proposed approach for seven types of emotion recognition. The performance of the proposed method is compared with existing approaches and better performance is obtained with proposed method. From experimental results it is observed that 75.32% and 84.5% recognition rate is obtained for Berlin emotional database and LDC emotional speech database respectively.
引用
收藏
页码:521 / 531
页数:11
相关论文
共 50 条
  • [1] Improving the performance of the speaker emotion recognition based on low dimension prosody features vector
    Ashishkumar Prabhakar Gudmalwar
    Ch V Rama Rao
    Anirban Dutta
    International Journal of Speech Technology, 2019, 22 : 521 - 531
  • [2] Prosody based emotion recognition for MEXI
    Austermann, A
    Esau, N
    Kleinjohann, L
    Kleinjohann, B
    2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 2430 - 2436
  • [3] Speaker-Independent Emotion Recognition based on Feature Vector Classification
    Park, Jeong-Sik
    Kim, Ji-Hwan
    Yoon, Sang-Min
    Oh, Yung-Hwan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2775 - +
  • [4] Performance Comparison of Speaker and Emotion Recognition
    Revathy, A.
    Shanmugapriya, P.
    Mohan, V.
    2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
  • [5] Speaker Recognition using Spectral Dimension Features
    Chen, Wen-Shiung
    Huang, Jr-Feng
    2009 FOURTH INTERNATIONAL MULTI-CONFERENCE ON COMPUTING IN THE GLOBAL INFORMATION TECHNOLOGY (ICCGI 2009), 2009, : 132 - 137
  • [6] Variational autoencoder for prosody-based speaker recognition
    Ben Alex, Starlet
    Mary, Leena
    ETRI JOURNAL, 2023, 45 (04) : 678 - 689
  • [7] Improving Speech Emotion Recognition System for a Social Robot with Speaker Recognition
    Juszkiewicz, Lukasz
    2014 19TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2014, : 921 - 925
  • [8] Speaker Recognition and Speech Emotion Recognition Based on GMM
    Xu, Shupeng
    Liu, Yan
    Liu, Xiping
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436
  • [9] Improving speaker recognition by training on emotion-added models
    Wu, T
    Yang, YC
    Wu, ZH
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 382 - 389
  • [10] Differenced Prosody Features from Normal and Stressed Regions for Emotion Recognition
    Raju, Vishnu Vidyadhara V.
    Gurugubelli, Krishna
    Alluri, K. N. R. K. Raju
    Vuppala, Anil Kumar
    2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 821 - 825