Towards more reality in the recognition of emotional speech

被引:0
|
作者
Schuller, Bjoern [1 ]
Seppi, Dino [2 ]
Batliner, Anton [3 ]
Maier, Andreas [3 ]
Steidl, Stefan [3 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-8000 Munich, Germany
[2] IRST, ITC, Trento, Italy
[3] Univ Erlangen Nurnberg, Lehrstuhl Mustererkennung, Nurnberg, Germany
关键词
emotion recognition; affective computing; noise robustness; spontaneous emotions;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
As automatic emotion recognition based on speech matures, new challenges can be faced. We therefore address the major aspects in view of potential applications in the field, to benchmark to ay's emotion recognition systems and bridge the gap between commercial interest and current performances: acted vs. spontaneous speech, realistic emotions, noise and microphone conditions, and speaker independence. Three different data-sets are used: the Berlin Emotional Speech Database, the Danish Emotional Speech Database, and the spontaneous AIBO Emotion Corpus. By using different feature types such as word- or turn-based statistics, manual versus forced alignment, and optimization techniques we show how to best cope with this demanding task and how noise addition or different microphone positions affect emotion recognition.
引用
收藏
页码:941 / +
页数:2
相关论文
共 50 条
  • [1] Influence of Emotional Speech on Continuous Speech Recognition
    Zgank, Andrej
    Maucec, Mirjam Sepesy
    [J]. 13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020), 2020,
  • [2] Prediction of emotional dimensions PAD for emotional speech recognition
    Sun, Ying
    Hu, Yan-Xiang
    Zhang, Xue-Ying
    Duan, Shu-Fei
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2019, 53 (10): : 2041 - 2048
  • [3] MORE ON SPEECH RECOGNITION SYSTEMS
    ZIER, MA
    [J]. M D COMPUTING, 1991, 8 (02): : 72 - 72
  • [4] Recognition of Emotional States in Natural Speech
    Kaminska, Dorota
    Sapinski, Tomasz
    Pelikant, Adam
    [J]. 2013 SIGNAL PROCESSING SYMPOSIUM (SPS), 2013,
  • [5] Deep Learning for Emotional Speech Recognition
    Sanchez-Gutierrez, Maximo E.
    Marcelo Albornoz, E.
    Martinez-Licona, Fabiola
    Leonardo Rufiner, H.
    Goddard, John
    [J]. PATTERN RECOGNITION, MCPR 2014, 2014, 8495 : 311 - +
  • [6] Dimensionality Reduction for Emotional Speech Recognition
    Fewzee, Pouria
    Karray, Fakhri
    [J]. PROCEEDINGS OF 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY, RISK AND TRUST AND 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM/PASSAT 2012), 2012, : 532 - 537
  • [7] Deep Learning for Emotional Speech Recognition
    Alhamada, M., I
    Khalifa, O. O.
    Abdalla, A. H.
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA2020), 2020, 2306
  • [8] New Features for Emotional Speech Recognition
    Palo, Hemanta Kumar
    Mohanty, Mihir Narayan
    Chandra, Mahesh
    [J]. 2015 IEEE POWER, COMMUNICATION AND INFORMATION TECHNOLOGY CONFERENCE (PCITC-2015), 2015, : 424 - 429
  • [9] Emotional Speech Recognition: A Multilingual Perspective
    Meftah, Ali
    Alotaibi, Yousef
    Selouani, Sid-Ahmed
    [J]. 2016 INTERNATIONAL CONFERENCE ON BIO-ENGINEERING FOR SMART TECHNOLOGIES (BIOSMART), 2016,
  • [10] Improved Emotional Speech Recognition Algorithms
    Rajeswari, A.
    Sowmbika, P.
    Kalaimagal, P.
    Ramya, M.
    Ranjitha, M.
    [J]. PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 2362 - 2366