END-TO-END LEARNING FOR DIMENSIONAL EMOTION RECOGNITION FROM PHYSIOLOGICAL SIGNALS

被引:0
|
作者
Keren, Gil [1 ]
Kirschstein, Tobias [1 ]
Marchi, Erik [1 ,2 ]
Ringeval, Fabien [1 ,3 ]
Schuller, Bjoern [1 ,4 ]
机构
[1] Univ Passau, Chair Complex & Intelligent Syst, Passau, Germany
[2] Apple Inc, Cupertino, CA 95014 USA
[3] Univ Grenoble Alpes, Lab Informat Grenoble, Grenoble, France
[4] Imperial Coll London, Dept Comp, London, England
关键词
End-to-end learning; Physiological signals; Emotion recognition; Convolutional Neural Networks; Long Short-Term Memory Recurrent Neural Networks;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Dimensional emotion recognition from physiological signals is a highly challenging task. Common methods rely on hand-crafted features that do not yet provide the performance necessary for real-life application. In this work, we exploit a series of convolutional and recurrent neural networks to predict affect from physiological signals, such as electrocardiogram and electrodermal activity, directly from the raw time representation. The motivation behind this so-called end-to-end approach is that, ultimately, the network learns an intermediate representation of the physiological signals that better suits the task at hand. Experimental evaluations show that, this very first study on end-to-end learning of emotion based on physiology, yields significantly better performance in comparison to existing work on the challenging RECOLA database, which includes fully spontaneous affective behaviors displayed during naturalistic interactions. Furthermore, we gain better understanding of the models' inner representations, by demonstrating that some cells' activations in the convolutional network are correlated to a large extent with hand-crafted features.
引用
收藏
页码:985 / 990
页数:6
相关论文
共 50 条
  • [31] End-to-End Emotion Semantic Parsing
    Jiang, Xiaotong
    Wang, Zhongqing
    Zhou, Guodong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 37 - 47
  • [32] A NOVEL END-TO-END SPEECH EMOTION RECOGNITION NETWORK WITH STACKED TRANSFORMER LAYERS
    Wang, Xianfeng
    Wang, Min
    Qi, Wenbo
    Su, Wanqi
    Wang, Xiangqian
    Zhou, Huan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6289 - 6293
  • [33] An End-To-End Emotion Recognition Framework Based on Temporal Aggregation of Multimodal Information
    Radoi, Anamaria
    Birhala, Andreea
    Ristea, Nicolae-Catalin
    Dutu, Liviu-Cristian
    IEEE ACCESS, 2021, 9 : 135559 - 135570
  • [34] DeepVANet: A Deep End-to-End Network for Multi-modal Emotion Recognition
    Zhang, Yuhao
    Hossain, Md Zakir
    Rahman, Shafin
    HUMAN-COMPUTER INTERACTION, INTERACT 2021, PT III, 2021, 12934 : 227 - 237
  • [35] Emotion recognition from physiological signals
    Gouizi K.
    Bereksi Reguig F.
    Maaoui C.
    Journal of Medical Engineering and Technology, 2011, 35 (6-7): : 300 - 307
  • [36] Traffic Signal Recognition Using End-to-End Deep Learning
    Sarker, Tonmoy
    Meng, Xiangyu
    TRAN-SET 2022, 2022, : 182 - 191
  • [37] Continual Learning for Monolingual End-to-End Automatic Speech Recognition
    Vander Eeckt, Steven
    Van Hamme, Hugo
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 459 - 463
  • [38] End-to-End Audiovisual Speech Recognition System With Multitask Learning
    Tao, Fei
    Busso, Carlos
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1 - 11
  • [39] End-to-End Speech Recognition Sequence Training With Reinforcement Learning
    Tjandra, Andros
    Sakti, Sakriani
    Nakamura, Satoshi
    IEEE ACCESS, 2019, 7 : 79758 - 79769
  • [40] End-to-End Automatic Speech Recognition with Deep Mutual Learning
    Masumura, Ryo
    Ihori, Mana
    Takashima, Akihiko
    Tanaka, Tomohiro
    Ashihara, Takanori
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 632 - 637