Continuous Emotion Recognition in Speech - Do We Need Recurrence?

被引:22
|
作者
Schmitt, Maximilian [1 ]
Cummins, Nicholas [1 ]
Schuller, Bjoern [1 ,2 ]
机构
[1] Univ Augsburg, ZD B Chair Embedded Intelligence Hlth Care & Well, Augsburg, Germany
[2] Imperial Coll London, GLAM Grp Language Audio & Mus, London, England
来源
关键词
affective computing; speech emotion recognition; human-computer interaction; computational paralinguistics; convolutional neural networks;
D O I
10.21437/Interspeech.2019-2710
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Emotion recognition in speech is a meaningful task in affective computing and human-computer interaction. As human emotion is a frequently changing state, it is usually represented as a densely sampled time series of emotional dimensions, typically arousal and valence. For this, recurrent neural network (RNN) architectures are employed by default when it comes to modelling the contours with deep learning approaches. However, the amount of temporal context required is questionable, and it has not yet been clarified whether the consideration of long-term dependencies is actually beneficial. In this contribution, we demonstrate that RNNs are not necessary to accomplish the task of time-continuous emotion recognition. Indeed, results gained indicate that deep neural networks incorporating less complex convolutional layers can provide more accurate models. We highlight the pros and cons of recurrent and non-recurrent approaches and evaluate our methods on the public SEWA database, which was used as a benchmark in the 2017 and 2018 editions of the Audio-Visual Emotion Challenge.
引用
收藏
页码:2808 / 2812
页数:5
相关论文
共 50 条
  • [1] Integrating Recurrence Dynamics for Speech Emotion Recognition
    Tzinis, Efthymios
    Paraskevopoulos, Georgios
    Baziotis, Christos
    Potamianos, Alexandros
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 927 - 931
  • [2] Continuous Wavelet Transform based Speech Emotion Recognition
    Shegokar, Pankaj
    Sircar, Pradip
    [J]. 2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
  • [3] Continuous speech emotion recognition with convolutional neural networks
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Matsiola, Maria
    Kotsakis, Rigas
    Dimoulas, Charalampos
    Kalliris, George
    [J]. AES: Journal of the Audio Engineering Society, 2020, 68 (1-2): : 14 - 24
  • [4] Continuous Speech Emotion Recognition with Convolutional Neural Networks
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Matsiola, Maria
    Kotsakis, Rigas
    Dimoulas, Charalampos
    Kalliris, George
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2020, 68 (1-2): : 14 - 24
  • [5] PREDICTION-BASED LEARNING FOR CONTINUOUS EMOTION RECOGNITION IN SPEECH
    Han, Jing
    Zhang, Zixing
    Ringeval, Fabien
    Schuller, Bjorn
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5005 - 5009
  • [6] Do We Really Need Continuous Glucose Monitoring?
    Peters, Anne
    [J]. DIABETES TECHNOLOGY & THERAPEUTICS, 2009, 11 : S128 - S130
  • [7] Speech Emotion Recognition
    Lalitha, S.
    Madhavan, Abhishek
    Bhushan, Bharath
    Saketh, Srinivas
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2014,
  • [8] Speech perception in VR: do we need individual recordings?
    Reyes-Lecuona, Arcadio
    Cuevas-Rodriguez, Maria
    Gonzalez-Toledo, Daniel
    Molina-Tanco, Luis
    Picinali, Lorenzo
    [J]. 2021 IMMERSIVE AND 3D AUDIO: FROM ARCHITECTURE TO AUTOMOTIVE (I3DA), 2021,
  • [9] RECONSTRUCTION-ERROR-BASED LEARNING FOR CONTINUOUS EMOTION RECOGNITION IN SPEECH
    Han, Jing
    Zhang, Zixing
    Ringeval, Fabien
    Schuller, Bjoern
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2367 - 2371
  • [10] Do we need to invent another emotion? The role of caring in attachment
    Youngstrom, E
    [J]. PSYCHOLOGICAL INQUIRY, 2000, 11 (02) : 103 - 107