Recognition of Emotional States in Natural Speech

被引:0
|
作者
Kaminska, Dorota [1 ]
Sapinski, Tomasz [1 ]
Pelikant, Adam [1 ]
机构
[1] Tech Univ Lodz, Inst Mechatron & Informat Syst, PL-90924 Lodz, Poland
关键词
EVOLUTION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Research in emotional speech recognition is generally focused on analysis of a set of primary emotions. However, it is clear that spontaneous speech, which is more intricate comparing to acted out utterances, carries information about emotional complexity or degree of their intensity. This research refers to the theory of Robert Plutchik, who suggested the existence of eight primary emotions. All other states are derivatives and occur as combinations, mixtures or compounds of the primary emotions. In this paper authors presents a novel approach for automatic speech clustering into subgroups representing primary emotions intensities. For this purpose a multimodal classifier based on k-means algorithm was implemented and tested on Polish spontaneous speech database. Studies have been conducted using prosodic features and perceptual coefficients. Results have shown that the proposed measure is effective in recognition of intensity of the predicted emotion.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Building a Recognition System of Speech Emotion and Emotional States
    Feng, Xiaoyan
    Watada, Junzo
    [J]. 2013 SECOND INTERNATIONAL CONFERENCE ON ROBOT, VISION AND SIGNAL PROCESSING (RVSP), 2013, : 253 - 258
  • [2] Recognition of Emotional states in Natural Human-Computer Interaction
    Milanova, Mariofanna
    Sirakov, Nikolay
    [J]. ISSPIT: 8TH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2008, : 186 - +
  • [3] Influence of Emotional Speech on Continuous Speech Recognition
    Zgank, Andrej
    Maucec, Mirjam Sepesy
    [J]. 13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020), 2020,
  • [4] Smart Recognition and Synthesis of Emotional Speech for Embedded Systems with Natural User Interfaces
    Malcangi, M.
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 867 - 871
  • [5] Prediction of emotional dimensions PAD for emotional speech recognition
    Sun, Ying
    Hu, Yan-Xiang
    Zhang, Xue-Ying
    Duan, Shu-Fei
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2019, 53 (10): : 2041 - 2048
  • [6] Deep Learning for Emotional Speech Recognition
    Sanchez-Gutierrez, Maximo E.
    Marcelo Albornoz, E.
    Martinez-Licona, Fabiola
    Leonardo Rufiner, H.
    Goddard, John
    [J]. PATTERN RECOGNITION, MCPR 2014, 2014, 8495 : 311 - +
  • [7] Dimensionality Reduction for Emotional Speech Recognition
    Fewzee, Pouria
    Karray, Fakhri
    [J]. PROCEEDINGS OF 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY, RISK AND TRUST AND 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM/PASSAT 2012), 2012, : 532 - 537
  • [8] Deep Learning for Emotional Speech Recognition
    Alhamada, M., I
    Khalifa, O. O.
    Abdalla, A. H.
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA2020), 2020, 2306
  • [9] New Features for Emotional Speech Recognition
    Palo, Hemanta Kumar
    Mohanty, Mihir Narayan
    Chandra, Mahesh
    [J]. 2015 IEEE POWER, COMMUNICATION AND INFORMATION TECHNOLOGY CONFERENCE (PCITC-2015), 2015, : 424 - 429
  • [10] Emotional Speech Recognition: A Multilingual Perspective
    Meftah, Ali
    Alotaibi, Yousef
    Selouani, Sid-Ahmed
    [J]. 2016 INTERNATIONAL CONFERENCE ON BIO-ENGINEERING FOR SMART TECHNOLOGIES (BIOSMART), 2016,