END-TO-END LEARNING FOR DIMENSIONAL EMOTION RECOGNITION FROM PHYSIOLOGICAL SIGNALS

被引：0

作者：

Keren, Gil ^{[1
]}

Kirschstein, Tobias ^{[1
]}

Marchi, Erik ^{[1
,2
]}

Ringeval, Fabien ^{[1
,3
]}

Schuller, Bjoern ^{[1
,4
]}

机构：

[1] Univ Passau, Chair Complex & Intelligent Syst, Passau, Germany

[2] Apple Inc, Cupertino, CA 95014 USA

[3] Univ Grenoble Alpes, Lab Informat Grenoble, Grenoble, France

[4] Imperial Coll London, Dept Comp, London, England

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2017年

关键词：

End-to-end learning; Physiological signals; Emotion recognition; Convolutional Neural Networks; Long Short-Term Memory Recurrent Neural Networks;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Dimensional emotion recognition from physiological signals is a highly challenging task. Common methods rely on hand-crafted features that do not yet provide the performance necessary for real-life application. In this work, we exploit a series of convolutional and recurrent neural networks to predict affect from physiological signals, such as electrocardiogram and electrodermal activity, directly from the raw time representation. The motivation behind this so-called end-to-end approach is that, ultimately, the network learns an intermediate representation of the physiological signals that better suits the task at hand. Experimental evaluations show that, this very first study on end-to-end learning of emotion based on physiology, yields significantly better performance in comparison to existing work on the challenging RECOLA database, which includes fully spontaneous affective behaviors displayed during naturalistic interactions. Furthermore, we gain better understanding of the models' inner representations, by demonstrating that some cells' activations in the convolutional network are correlated to a large extent with hand-crafted features.

引用

页码：985 / 990

页数：6

共 50 条

[41] An End-to-End Deep Learning Framework for Wideband Signal Recognition
Vagollari, Adela
Hirschbeck, Martin
Gerstacker, Wolfgang
IEEE ACCESS, 2023, 11 : 52899 - 52922
[42] End-to-end Convolutional Sequence Learning for ASL Fingerspelling Recognition
Papadimitriou, Katerina
Potamianos, Gerasimos
INTERSPEECH 2019, 2019, : 2315 - 2319
[43] Arabic speech recognition using end-to-end deep learning
Alsayadi, Hamzah A.
Abdelhamid, Abdelaziz A.
Hegazy, Islam
Fayed, Zaki T.
IET SIGNAL PROCESSING, 2021, 15 (08) : 521 - 534
[44] Investigation of Transfer Learning for End-to-End Russian Speech Recognition
Kipyatkova, Irina
SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 349 - 357
[45] SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition
Fu, Li
Li, Xiaoxiao
Wang, Runyu
Fan, Lu
Zhang, Zhengchen
Chen, Meng
Wu, Youzheng
He, Xiaodong
INTERSPEECH 2022, 2022, : 1006 - 1010
[46] Online Continual Learning of End-to-End Speech Recognition Models
Yang, Muqiao
Lane, Ian
Watanabe, Shinji
INTERSPEECH 2022, 2022, : 2668 - 2672
[47] Toward End-to-End Face Recognition Through Alignment Learning
Zhong, Yuanyi
Chen, Jiansheng
Huang, Bo
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (08) : 1213 - 1217
[48] Combining Articulatory Features with End-to-End Learning in Speech Recognition
Qu, Leyuan
Weber, Cornelius
Lakomkin, Egor
Twiefel, Johannes
Wermter, Stefan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 500 - 510
[49] Sleep Arousal Detection Using End-to-End Deep Learning Method Based on Multi-Physiological Signals
Li, Haoqi
Cao, Qineng
Zhong, Yizhou
Pan, Yun
2018 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), 2018, 45
[50] END-TO-END CONTINUOUS EMOTION RECOGNITION FROM VIDEO USING 3D CONVLSTM NETWORKS
Huang, Jian
Li, Ya
Tao, Jianhua
Lian, Zheng
Yi, Jiangyan
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6837 - 6841

← 1 2 3 4 5 →