END-TO-END LEARNING FOR DIMENSIONAL EMOTION RECOGNITION FROM PHYSIOLOGICAL SIGNALS

被引:0
|
作者
Keren, Gil [1 ]
Kirschstein, Tobias [1 ]
Marchi, Erik [1 ,2 ]
Ringeval, Fabien [1 ,3 ]
Schuller, Bjoern [1 ,4 ]
机构
[1] Univ Passau, Chair Complex & Intelligent Syst, Passau, Germany
[2] Apple Inc, Cupertino, CA 95014 USA
[3] Univ Grenoble Alpes, Lab Informat Grenoble, Grenoble, France
[4] Imperial Coll London, Dept Comp, London, England
关键词
End-to-end learning; Physiological signals; Emotion recognition; Convolutional Neural Networks; Long Short-Term Memory Recurrent Neural Networks;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Dimensional emotion recognition from physiological signals is a highly challenging task. Common methods rely on hand-crafted features that do not yet provide the performance necessary for real-life application. In this work, we exploit a series of convolutional and recurrent neural networks to predict affect from physiological signals, such as electrocardiogram and electrodermal activity, directly from the raw time representation. The motivation behind this so-called end-to-end approach is that, ultimately, the network learns an intermediate representation of the physiological signals that better suits the task at hand. Experimental evaluations show that, this very first study on end-to-end learning of emotion based on physiology, yields significantly better performance in comparison to existing work on the challenging RECOLA database, which includes fully spontaneous affective behaviors displayed during naturalistic interactions. Furthermore, we gain better understanding of the models' inner representations, by demonstrating that some cells' activations in the convolutional network are correlated to a large extent with hand-crafted features.
引用
收藏
页码:985 / 990
页数:6
相关论文
共 50 条
  • [41] An End-to-End Deep Learning Framework for Wideband Signal Recognition
    Vagollari, Adela
    Hirschbeck, Martin
    Gerstacker, Wolfgang
    IEEE ACCESS, 2023, 11 : 52899 - 52922
  • [42] End-to-end Convolutional Sequence Learning for ASL Fingerspelling Recognition
    Papadimitriou, Katerina
    Potamianos, Gerasimos
    INTERSPEECH 2019, 2019, : 2315 - 2319
  • [43] Arabic speech recognition using end-to-end deep learning
    Alsayadi, Hamzah A.
    Abdelhamid, Abdelaziz A.
    Hegazy, Islam
    Fayed, Zaki T.
    IET SIGNAL PROCESSING, 2021, 15 (08) : 521 - 534
  • [44] Investigation of Transfer Learning for End-to-End Russian Speech Recognition
    Kipyatkova, Irina
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 349 - 357
  • [45] SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition
    Fu, Li
    Li, Xiaoxiao
    Wang, Runyu
    Fan, Lu
    Zhang, Zhengchen
    Chen, Meng
    Wu, Youzheng
    He, Xiaodong
    INTERSPEECH 2022, 2022, : 1006 - 1010
  • [46] Online Continual Learning of End-to-End Speech Recognition Models
    Yang, Muqiao
    Lane, Ian
    Watanabe, Shinji
    INTERSPEECH 2022, 2022, : 2668 - 2672
  • [47] Toward End-to-End Face Recognition Through Alignment Learning
    Zhong, Yuanyi
    Chen, Jiansheng
    Huang, Bo
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (08) : 1213 - 1217
  • [48] Combining Articulatory Features with End-to-End Learning in Speech Recognition
    Qu, Leyuan
    Weber, Cornelius
    Lakomkin, Egor
    Twiefel, Johannes
    Wermter, Stefan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 500 - 510
  • [49] Sleep Arousal Detection Using End-to-End Deep Learning Method Based on Multi-Physiological Signals
    Li, Haoqi
    Cao, Qineng
    Zhong, Yizhou
    Pan, Yun
    2018 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), 2018, 45
  • [50] END-TO-END CONTINUOUS EMOTION RECOGNITION FROM VIDEO USING 3D CONVLSTM NETWORKS
    Huang, Jian
    Li, Ya
    Tao, Jianhua
    Lian, Zheng
    Yi, Jiangyan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6837 - 6841