A Study on the Search of the Most Discriminative Speech Features in the Speaker Dependent Speech Emotion Recognition

被引:6
|
作者
Pao, Tsang-Long [1 ]
Wang, Chun-Hsiang [1 ]
Li, Yu-Ji [1 ]
机构
[1] Tatung Univ, Dept Comp Sci & Engn, Taipei 104, Taiwan
关键词
Speech Emotion Recognition; Speech Feature Selection; WD-KNN Classifier; GMM Classifier; IDENTIFICATION;
D O I
10.1109/PAAP.2012.31
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Expressing emotion to others and recognizing emotion state of the counterpart are not difficult for human. Emotion state of a person may be recognized from the facial expression, voice, and/or gesture. Speech emotion recognition research gained a lot of attention in recent years. One of the important subjects in speech emotion recognition research is the feature selection. The speech features used will greatly influence the recognition rate. In this research, we try to find the most discriminative features for emotion recognition out from a set of 78 features. We use these features to study the feature characteristics for individual speaker by using a GMM classifier. We obtained an average of 71% recognition rate in speaker dependent case while an average of 48% recognition rate in speaker independent case.
引用
收藏
页码:157 / 162
页数:6
相关论文
共 50 条
  • [1] Speaker Attentive Speech Emotion Recognition
    Le Moine, Clement
    Obin, Nicolas
    Roebel, Axel
    INTERSPEECH 2021, 2021, : 2866 - 2870
  • [2] Speaker Awareness for Speech Emotion Recognition
    Assuncao, Gustavo
    Menezes, Paulo
    Perdigao, Fernando
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2020, 16 (04) : 15 - 22
  • [3] Speech Emotion Recognition Based on Transfer Emotion-Discriminative Features Subspace Learning
    Zhang, Kexin
    Liu, Yunxiang
    IEEE ACCESS, 2023, 11 : 56336 - 56343
  • [4] Speech Emotion Recognition with Discriminative Feature Learning
    Zhou, Huan
    Liu, Kai
    INTERSPEECH 2020, 2020, : 4094 - 4097
  • [5] Discriminative Feature Learning for Speech Emotion Recognition
    Zhang, Yuying
    Zou, Yuexian
    Peng, Junyi
    Luo, Danqing
    Huang, Dongyan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 198 - 210
  • [6] Speaker Recognition and Speech Emotion Recognition Based on GMM
    Xu, Shupeng
    Liu, Yan
    Liu, Xiping
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436
  • [7] An evaluation of visual speech features for the tasks of speech and speaker recognition
    Lucey, S
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 260 - 267
  • [8] Speech Databases, Speech Features, and Classifiers in Speech Emotion Recognition: A Review
    Mohmad Dar, G.H.
    Delhibabu, Radhakrishnan
    IEEE Access, 2024, 12 : 151122 - 151152
  • [9] Speaker Dependent Speech Emotion Recognition using MFCC and Support Vector Machine
    Dahake, Prajakta P.
    Shaw, Kailash
    Malathi, P.
    2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 1080 - 1084
  • [10] Speaker-Dependent Bottleneck Features for Egyptian Arabic Speech Recognition
    Romanenko, Aleksei
    Mendelev, Valentin
    SPEECH AND COMPUTER, 2016, 9811 : 620 - 626