Towards more reality in the recognition of emotional speech

被引:0
|
作者
Schuller, Bjoern [1 ]
Seppi, Dino [2 ]
Batliner, Anton [3 ]
Maier, Andreas [3 ]
Steidl, Stefan [3 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-8000 Munich, Germany
[2] IRST, ITC, Trento, Italy
[3] Univ Erlangen Nurnberg, Lehrstuhl Mustererkennung, Nurnberg, Germany
关键词
emotion recognition; affective computing; noise robustness; spontaneous emotions;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
As automatic emotion recognition based on speech matures, new challenges can be faced. We therefore address the major aspects in view of potential applications in the field, to benchmark to ay's emotion recognition systems and bridge the gap between commercial interest and current performances: acted vs. spontaneous speech, realistic emotions, noise and microphone conditions, and speaker independence. Three different data-sets are used: the Berlin Emotional Speech Database, the Danish Emotional Speech Database, and the spontaneous AIBO Emotion Corpus. By using different feature types such as word- or turn-based statistics, manual versus forced alignment, and optimization techniques we show how to best cope with this demanding task and how noise addition or different microphone positions affect emotion recognition.
引用
收藏
页码:941 / +
页数:2
相关论文
共 50 条
  • [31] Emotional speech: Towards a new generation of databases
    Douglas-Cowie, E
    Campbell, N
    Cowie, R
    Roach, P
    [J]. SPEECH COMMUNICATION, 2003, 40 (1-2) : 33 - 60
  • [32] Facial emotional recognition in schizophrenia: preliminary results of the Virtual Reality Program for Facial Emotional Recognition
    Souto, Teresa
    Baptista, Alexandre
    Tavares, Diana
    Queiros, Cristina
    Marques, Antonio
    [J]. REVISTA DE PSIQUIATRIA CLINICA, 2013, 40 (04): : 129 - 134
  • [33] Towards inclusive automatic speech recognition
    Feng, Siyuan
    Halpern, Bence Mark
    Kudina, Olya
    Scharenborg, Odette
    [J]. COMPUTER SPEECH AND LANGUAGE, 2024, 84
  • [34] Towards speech recognition oriented dereverberation
    Jinachitra, P
    Prieto, RE
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 437 - 440
  • [35] Towards hierarchical speech recognition systems
    Elison, J
    Zhang, Y
    Yfantis, EA
    [J]. PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2000, : 64 - 68
  • [36] Towards automatic recognition of emotion in speech
    Razak, AA
    Yusof, MHM
    Komiya, R
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 548 - 551
  • [37] Emotional Intelligence, Not Music Training, Predicts Recognition of Emotional Speech Prosody
    Trimmer, Christopher G.
    Cuddy, Lola L.
    [J]. EMOTION, 2008, 8 (06) : 838 - 849
  • [38] Acoustic Model Adaptation for Emotional Speech Recognition Using Twitter-Based Emotional Speech Corpus
    Kosaka, Tetsuo
    Aizawa, Yoshitaka
    Kato, Masaharu
    Nose, Takashi
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1747 - 1751
  • [39] Emotional experiences in schizophrenia:: Towards more clarity?
    Antonius, D.
    Tremeau, F.
    Ziwich, R.
    Jalbrizkowski, M.
    Silipo, G.
    Butler, P. D.
    Javitt, D. C.
    [J]. SCHIZOPHRENIA BULLETIN, 2007, 33 (02) : 581 - 581
  • [40] Deep Learning Analysis Models for Speech and Emotional Recognition
    Wu, Jun
    Zhu, Tianliang
    Yu, Chengtian
    Wang, Chunzhi
    Zhou, Xianjing
    Liu, Hu
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1541 - 1545