A Study on the Search of the Most Discriminative Speech Features in the Speaker Dependent Speech Emotion Recognition

被引:6
|
作者
Pao, Tsang-Long [1 ]
Wang, Chun-Hsiang [1 ]
Li, Yu-Ji [1 ]
机构
[1] Tatung Univ, Dept Comp Sci & Engn, Taipei 104, Taiwan
关键词
Speech Emotion Recognition; Speech Feature Selection; WD-KNN Classifier; GMM Classifier; IDENTIFICATION;
D O I
10.1109/PAAP.2012.31
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Expressing emotion to others and recognizing emotion state of the counterpart are not difficult for human. Emotion state of a person may be recognized from the facial expression, voice, and/or gesture. Speech emotion recognition research gained a lot of attention in recent years. One of the important subjects in speech emotion recognition research is the feature selection. The speech features used will greatly influence the recognition rate. In this research, we try to find the most discriminative features for emotion recognition out from a set of 78 features. We use these features to study the feature characteristics for individual speaker by using a GMM classifier. We obtained an average of 71% recognition rate in speaker dependent case while an average of 48% recognition rate in speaker independent case.
引用
收藏
页码:157 / 162
页数:6
相关论文
共 50 条
  • [21] AUTOMATED SPEECH RECOGNITION SYSTEM FOR SPEAKER EMOTION CLASSIFICATION
    Anithadevi, N.
    Gokul, P.
    Nandan, S. Muhil
    Magesh, R.
    Shiddharth, S.
    PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,
  • [22] Learning Discriminative Features using Center Loss and Reconstruction as Regularizer for Speech Emotion Recognition
    Tripathi, Suraj
    Ramesh, Abhiram
    Kumar, Abhay
    Singh, Chirag
    Yenigalla, Promod
    WORKSHOP ON ARTIFICIAL INTELLIGENCE IN AFFECTIVE COMPUTING, VOL 122, 2019, 122 : 44 - 53
  • [23] LEARNING DISCRIMINATIVE FEATURES FROM SPECTROGRAMS USING CENTER LOSS FOR SPEECH EMOTION RECOGNITION
    Dai, Dongyang
    Wu, Zhiyong
    Li, Runnan
    Wu, Xixin
    Jia, Jia
    Meng, Helen
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7405 - 7409
  • [24] Discriminative speaker adaptation in Persian continuous speech recognition systems
    Pirhosseinloo, Shadi
    Ganj, Farshad Almas
    4TH INTERNATIONAL CONFERENCE OF COGNITIVE SCIENCE, 2012, 32 : 296 - 301
  • [25] Learning emotion-discriminative and domain-invariant features for domain adaptation in speech emotion recognition
    Mao, Qirong
    Xu, Guopeng
    Xue, Wentao
    Gou, Jianping
    Zhan, Yongzhao
    SPEECH COMMUNICATION, 2017, 93 : 1 - 10
  • [26] Speaker-independent speech emotion recognition by fusion of functional and accompanying paralanguage features
    Qi-rong Mao
    Xiao-lei Zhao
    Zheng-wei Huang
    Yong-zhao Zhan
    Journal of Zhejiang University SCIENCE C, 2013, 14 : 573 - 582
  • [27] On the relevance of high-level features for speaker independent emotion recognition of spontaneous speech
    Lugger, Marko
    Yang, Bin
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1959 - 1962
  • [28] Speaker-independent speech emotion recognition by fusion of functional and accompanying paralanguage features
    Qi-rong MAO
    Xiao-lei ZHAO
    Zheng-wei HUANG
    Yong-zhao ZHAN
    Frontiers of Information Technology & Electronic Engineering, 2013, 14 (07) : 573 - 582
  • [29] Speaker-independent speech emotion recognition by fusion of functional and accompanying paralanguage features
    Mao, Qi-rong
    Zhao, Xiao-lei
    Huang, Zheng-wei
    Zhan, Yong-zhao
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2013, 14 (07): : 573 - 582
  • [30] Speaker Dependent, Speaker Independent and Cross Language Emotion Recognition From Speech Using GMM and HMM
    Bhaykar, Manav
    Yadav, Jainath
    Rao, K. Sreenivasa
    2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,