Significance of Phonological Features in Speech Emotion Recognition

被引:7
|
作者
Wang, Wei [1 ]
Watters, Paul A. [2 ]
Cao, Xinyi [1 ]
Shen, Lingjie [1 ]
Li, Bo [3 ]
机构
[1] Nanjing Normal Univ, Sch Educ Sci, Nanjing 210097, JS, Peoples R China
[2] La Trobe Univ, Dept Comp Sci & Informat Technol, Melbourne, Vic 3350, Australia
[3] Univ Southern Mississippi, Sch Comp Sci & Comp Engn, 730 East Beach Blvd, Long Beach, MS 39560 USA
基金
中国国家自然科学基金;
关键词
Speech emotion recognition; Phonological features; Feature analysis; Acoustic features;
D O I
10.1007/s10772-020-09734-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel Speech Emotion Recognition (SER) method based on phonological features is proposed in this paper. Intuitively, as expert knowledge derived from linguistics, phonological features are correlated with emotions. However, it has been found that they are seldomly used as features to improve SER. Motivated by this, we set our goal to utilize phonological features to further advance SER's accuracy since they can provide complementary information for the task. Furthermore, we will also explore the relationship between phonological features and emotions. Firstly, instead of only based on acoustic features, we devise a new SER approach by fusing phonological representations and acoustic features together. A significant improvement in SER performance has been demonstrated on a publicly available SER database named Interactive Emotional Dyadic Motion Capture (IEMOCAP). Secondly, the experimental results show that the top-performing method for the task of categorical emotion recognition is a deep learning-based classifier which generates an unweighted average recall (UAR) accuracy of 60.02%. Finally, we investigate the most discriminative features and find some patterns of emotional rhyme based on the phonological representations.
引用
下载
收藏
页码:633 / 642
页数:10
相关论文
共 50 条
  • [1] Significance of Phonological Features in Speech Emotion Recognition
    Wei Wang
    Paul A. Watters
    Xinyi Cao
    Lingjie Shen
    Bo Li
    International Journal of Speech Technology, 2020, 23 : 633 - 642
  • [2] Speech Recognition with Phonological Features: Some issues to attend
    Stouten, Frederik
    Martens, Jean-Pierre
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 357 - 360
  • [3] Integrating Language and Emotion Features for Multilingual Speech Emotion Recognition
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    HUMAN-COMPUTER INTERACTION. MULTIMODAL AND NATURAL INTERACTION, HCI 2020, PT II, 2020, 12182 : 187 - 196
  • [4] Novel acoustic features for speech emotion recognition
    ROH Yong-Wan
    KIM Dong-Ju
    LEE Woo-Seok
    HONG Kwang-Seok
    Science China Technological Sciences, 2009, 52 (07) : 1838 - 1848
  • [5] Exploiting the potentialities of features for speech emotion recognition
    Li, Dongdong
    Zhou, Yijun
    Wang, Zhe
    Gao, Daqi
    INFORMATION SCIENCES, 2021, 548 : 328 - 343
  • [6] Learning Transferable Features for Speech Emotion Recognition
    Marczewski, Alison
    Veloso, Adriano
    Ziviani, Nivio
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 529 - 536
  • [7] Applying articulatory features to speech emotion recognition
    Zhou, Yu
    Sun, Yanqing
    Yang, Lin
    Yan, Yonghong
    2009 INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN COMPUTER SCIENCE, ICRCCS 2009, 2009, : 73 - 76
  • [8] Novel acoustic features for speech emotion recognition
    Yong-Wan Roh
    Dong-Ju Kim
    Woo-Seok Lee
    Kwang-Seok Hong
    Science in China Series E: Technological Sciences, 2009, 52 : 1838 - 1848
  • [9] Speech Emotion Recognition using Combination of Features
    Zhang, Qingli
    An, Ning
    Wang, Kunxia
    Ren, Fuji
    Li, Lian
    PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 523 - 528
  • [10] Adding dimensional features for emotion recognition on speech
    Ben Letaifa, Leila
    Ines Torres, Maria
    Justo, Raquel
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,