A Novel Emotion Recognizer from Speech Using Both Prosodic and Linguistic Features

被引:0
|
作者
Suzuki, Motoyuki [1 ]
Tsuchiya, Seiji [2 ]
Ren, Fuji [1 ]
机构
[1] Univ Tokushima, Inst Sci & Technol, 2-1 Minamijosanjima Cho, Tokushima 7708506, Japan
[2] Doshisha Univ, Dept Intelligent Informat Engn, Kyoto 6100394, Japan
关键词
Emotion recognition; prosodic feature; linguistic feature; association mechanism;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition based on speech characteristics generally relies on prosodic information. However, utterances with different emotions in speech have similar prosodic features, so it is difficult to recognize emotion by using only prosodic features. In this paper, we propose a novel approach to emotion recognition that considers both prosodic and linguistic features. First, possible emotions are output by clustering-based emotion recognizer, which only uses prosodic features. Then, subtitles given by the speech recognizer are input for another emotion recognizer based on the "Association Mechanism." It outputs a possible emotion by using only linguistic information. Lastly, the intersection of the two sets of possible emotions is integrated into the final result. Experimental results showed that the proposed method achieved higher performance than either prosodic-or linguistic-based emotion recognition. In a comparison with manually labeled data, the F-measure was 32.6%. On the other hand, the average of F-measures of labeled data given by other humans was 42.9%. This means that the proposed method performed at 75.9% in relation to human ability.
引用
收藏
页码:456 / 465
页数:10
相关论文
共 50 条
  • [31] Novel acoustic features for speech emotion recognition
    Roh Yong-Wan
    Kim Dong-Ju
    Lee Woo-Seok
    Hong Kwang-Seok
    SCIENCE IN CHINA SERIES E-TECHNOLOGICAL SCIENCES, 2009, 52 (07): : 1838 - 1848
  • [32] Integrated System for Prosodic Features Detection from Speech
    Zbancioc, Marius Dan
    Feraru, Monica
    2014 INTERNATIONAL CONFERENCE AND EXPOSITION ON ELECTRICAL AND POWER ENGINEERING (EPE), 2014, : 114 - 117
  • [33] Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Features
    Starlet Ben Alex
    Leena Mary
    Ben P. Babu
    Circuits, Systems, and Signal Processing, 2020, 39 : 5681 - 5709
  • [34] Evaluation of linguistic and prosodic features for detection of Alzheimer’s disease in Turkish conversational speech
    Ali Khodabakhsh
    Fatih Yesil
    Ekrem Guner
    Cenk Demiroglu
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [35] Recognizing emotion from Turkish speech using acoustic features
    Oflazoglu, Caglar
    Yildirim, Serdar
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [36] Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Features
    Ben Alex, Starlet
    Mary, Leena
    Babu, Ben P.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (11) : 5681 - 5709
  • [37] Recognizing emotion from Turkish speech using acoustic features
    Caglar Oflazoglu
    Serdar Yildirim
    EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [38] Evaluation of linguistic and prosodic features for detection of Alzheimer's disease in Turkish conversational speech
    Khodabakhsh, Ali
    Yesil, Fatih
    Guner, Ekrem
    Demiroglu, Cenk
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
  • [39] Speech Emotion Recognition by Late Fusion of Linguistic and Acoustic Features using Deep Learning Models
    Sato, Kiyohide
    Kishi, Keita
    Kosaka, Tetsuo
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1013 - 1018
  • [40] Emotional head motion predicting from prosodic and linguistic features
    Minghao Yang
    Jinlin Jiang
    Jianhua Tao
    Kaihui Mu
    Hao Li
    Multimedia Tools and Applications, 2016, 75 : 5125 - 5146