A Novel Emotion Recognizer from Speech Using Both Prosodic and Linguistic Features

被引:0
|
作者
Suzuki, Motoyuki [1 ]
Tsuchiya, Seiji [2 ]
Ren, Fuji [1 ]
机构
[1] Univ Tokushima, Inst Sci & Technol, 2-1 Minamijosanjima Cho, Tokushima 7708506, Japan
[2] Doshisha Univ, Dept Intelligent Informat Engn, Kyoto 6100394, Japan
关键词
Emotion recognition; prosodic feature; linguistic feature; association mechanism;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition based on speech characteristics generally relies on prosodic information. However, utterances with different emotions in speech have similar prosodic features, so it is difficult to recognize emotion by using only prosodic features. In this paper, we propose a novel approach to emotion recognition that considers both prosodic and linguistic features. First, possible emotions are output by clustering-based emotion recognizer, which only uses prosodic features. Then, subtitles given by the speech recognizer are input for another emotion recognizer based on the "Association Mechanism." It outputs a possible emotion by using only linguistic information. Lastly, the intersection of the two sets of possible emotions is integrated into the final result. Experimental results showed that the proposed method achieved higher performance than either prosodic-or linguistic-based emotion recognition. In a comparison with manually labeled data, the F-measure was 32.6%. On the other hand, the average of F-measures of labeled data given by other humans was 42.9%. This means that the proposed method performed at 75.9% in relation to human ability.
引用
收藏
页码:456 / 465
页数:10
相关论文
共 50 条
  • [41] Emotional head motion predicting from prosodic and linguistic features
    Yang, Minghao
    Jiang, Jinlin
    Tao, Jianhua
    Mu, Kaihui
    Li, Hao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (09) : 5125 - 5146
  • [42] HMM-based speech recognizer using overlapping articulatory features
    Erler, Kevin
    Freeman, George H.
    Journal of the Acoustical Society of America, 1996, 100 (4 pt 1):
  • [43] An HMM-based speech recognizer using overlapping articulatory features
    Erler, K
    Freeman, GH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (04): : 2500 - 2513
  • [44] ACCENT DETECTION OF TELUGU SPEECH USING PROSODIC AND FORMANT FEATURES
    Mannepalli, Kasiprasad
    Sastry, P. Nrahari
    Rajesh, V.
    2015 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION ENGINEERING SYSTEMS (SPACES), 2015, : 318 - 322
  • [45] Dialect Identification from Assamese Speech using Prosodic Features and a Neuro Fuzzy Classifier
    Sarma, Mousmita
    Sarma, Kandarpa Kumar
    2016 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2016, : 127 - 132
  • [46] Speech Emotion Recognition Using Novel HHT-TEO Based Features
    Xiang, Li
    Xin, Li
    JOURNAL OF COMPUTERS, 2011, 6 (05) : 989 - 998
  • [47] Speech Emotion Recognition using Combination of Features
    Zhang, Qingli
    An, Ning
    Wang, Kunxia
    Ren, Fuji
    Li, Lian
    PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 523 - 528
  • [48] EMOTION CLASSIFICATION OF SPEECH USING MODULATION FEATURES
    Chaspari, Theodora
    Dimitriadis, Dimitrios
    Maragos, Petros
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1552 - 1556
  • [49] Speech Emotion Classification using Acoustic Features
    Chen, Shizhe
    Jin, Qin
    Li, Xirong
    Yang, Gang
    Xu, Jieping
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 579 - 583
  • [50] Emotion recognition from telephone speech using acoustic and nonlinear features
    Bedoya-Jaramillo, S.
    Orozco-Arroyave, J. R.
    Arias-Londono, J. D.
    Vargas-Bonilla, J. F.
    2013 47TH INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2013,