Emotion recognition and evaluation from Mandarin speech signals

被引:0
|
作者
Pao, Tsanglong [1 ]
Chen, Yute [1 ]
Yeh, Junheng [1 ]
机构
[1] Tatung Univ, Dept Comp Sci & Engn, Taipei 104, Taiwan
关键词
emotional speech recognition and evaluation; radar chart; weighted D-KNN classifier; McNemar's test;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The exploration of how human beings react to the world and interact with it and each other remains one of the greatest scientific challenges. The ability to recognize affective states of a person we face is the core of emotional intelligence. In the past, several classifiers were adopted independently and tested on several emotional speech corpora with different language, size, number of emotional states and recording method. This makes it difficult to compare and evaluate the performance of those classifiers. In this paper, we implemented a weighted discrete k-nearest neighbor (weighted D-KNN) classification algorithm and compared it with KNN, M-KNN and SVM classification methods by applying them to a Mandarin speech corpus. This speech corpus contains of five basic emotions: anger, happiness, boredom, sadness and neutral. The results of experiments and McNemar's test revealed that the implemented weighted D-KNN method performed best among these classifiers and achieved an accuracy of 81.4%. Besides, we implemented an emotion radar chart which is based on weighted D-KNN and can present the intensity of each emotion component in the speech in our emotion evaluation system. Such system can be further used in speech training, especially for hearing-impaired to learn how to express emotions in speech more naturally.
引用
收藏
页码:1695 / 1709
页数:15
相关论文
共 50 条
  • [1] Mandarin emotion recognition in speech
    Pao, TL
    Chen, YT
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 227 - 230
  • [2] Emotion recognition from Madarin speech signals
    Pao, TL
    Chen, YT
    Yeh, JH
    [J]. 2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 301 - 304
  • [3] Research on Mandarin Chinese in Speech Emotion Recognition
    Wang, Ziyun
    Guo, Xiao
    [J]. 2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 99 - 103
  • [4] EmoEars: An emotion recognition system for mandarin speech
    Xie, B
    Chen, L
    Chen, GC
    Chen, C
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 941 - 948
  • [5] Automatic Emotion Recognition of Speech Signal in Mandarin
    Zhang, Sheng
    Ching, P. C.
    Kong, Fanrang
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1810 - +
  • [6] Improving Automatic Emotion Recognition from Speech Signals
    Bozkurt, Elif
    Erzin, Engin
    Erdem, Cigdem Eroglu
    Erdem, A. Tanju
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 312 - +
  • [7] Comparison of several classifiers for emotion recognition from noisy mandarin speech
    Pao, Tsang-Long
    Liao, Wen-Yuan
    Chen, Yu-Te
    Yeh, Jun-Heng
    Cheng, Yun-Maw
    Chien, Charles S.
    [J]. 2007 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, PROCEEDINGS, 2007, : 23 - +
  • [8] Emotion Recognition from Noisy Mandarin Speech Preprocessed by Compressed Sensing
    Jiang, Xiaoqing
    He, Dapeng
    Yang, Xinghai
    Wang, Lingyin
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT II, 2017, 10362 : 626 - 636
  • [9] Recognition and Analysis of Emotion Transition in Mandarin Speech Signal
    Pao, Tsang-Long
    Yeh, Jun-Heng
    Tsai, Yao-Wei
    [J]. IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3326 - 3332
  • [10] Statistical feature selection for mandarin speech emotion recognition
    Xie, B
    Chen, L
    Chen, GC
    Chen, C
    [J]. ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 591 - 600