Deep Learning Based Video Spatio-Temporal Modeling for Emotion Recognition

被引：5

作者：

Fonnegra, Ruben D. ^{[1
]}

Diaz, Gloria M. ^{[1
]}

机构：

[1] Inst Tecnol Metropolitano, Medellin, Colombia

来源：

HUMAN-COMPUTER INTERACTION: THEORIES, METHODS, AND HUMAN ISSUES, HCI INTERNATIONAL 2018, PT I | 2018年 / 10901卷

关键词：

Deep learning; Facial emotion recognition; Spatio-temporal modeling; FACIAL EXPRESSION RECOGNITION; DESIGN; SYSTEM;

D O I：

10.1007/978-3-319-91238-7_32

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Affective Computing is a growing research area, which aims to determine the emotional user states through their conscious and unconscious actions and use it to modify the machine interaction. This paper investigates the discriminative abilities of convolutional and recurrent neural networks to modeling spatio-temporal features from video sequences of the face region. In a deep learning architecture, dense convolutional layers are used for analyzing spatial information changes in frames during short time periods, while dense recurrent layers are used to model changes in frames as temporal sequences that change across the time. Those layers are then connected to a multilayer perceptron (MLP) to perform the classification task, which consists in to distinguish between six different emotion categories. The performance was twofold evaluated: gender independent and gender-dependent classifications. Experimental results show that the proposed approach achieves an accuracy of 81.84%, in the gender independent experiment, which outperforms previous works using the same experimental data. In the gender-dependent experiment, accuracy was 80.79% and 82.75% for male and female, respectively.

引用

页码：397 / 408

页数：12

共 50 条

[31] Spatio-Temporal Image-Based Encoded Atlases for EEG Emotion Recognition
Avola, Danilo
Cinque, Luigi
Mambro, Angelo Di
Fagioli, Alessio
Marini, Marco Raoul
Pannone, Daniele
Fanini, Bruno
Foresti, Gian Luca
[J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2024, 34 (05)
[32] Spatio-temporal Video Autoencoder for Human Action Recognition
Sousa e Santos, Anderson Carlos
Pedrini, Helio
[J]. PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 114 - 123
[33] Spatio-temporal detection of auroral substorm based on deep learning
Yang QiuJu
Ren Jie
Xiang Han
[J]. CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2022, 65 (03): : 898 - 907
[34] Interpretable Spatio-temporal Attention for Video Action Recognition
Meng, Lili
Zhao, Bo
Chang, Bo
Huang, Gao
Sun, Wei
Tung, Frederich
Sigal, Leonid
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1513 - 1522
[35] Exploiting spatio-temporal knowledge for video action recognition
Zhang, Huigang
Wang, Liuan
Sun, Jun
[J]. IET COMPUTER VISION, 2023, 17 (02) : 222 - 230
[36] Kronecker PCA Based Spatio-Temporal Modeling of Video for Dismount Classification
Greenewald, Kristjan H.
Hero, Alfred O., III
[J]. ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXI, 2014, 9093
[37] LEARNING SPATIO-TEMPORAL DEPENDENCIES FOR ACTION RECOGNITION
Cai, Qiao
Yin, Yafeng
Man, Hong
[J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3740 - 3744
[38] Spatio-Temporal Context Modeling for BoW-Based Video Classification
Yi, Saehoon
Pavlovic, Vladimir
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 779 - 786
[39] Video summarization via spatio-temporal deep architecture
Zhong, Sheng-hua
Wu, Jiaxin
Jiang, Jianmin
[J]. NEUROCOMPUTING, 2019, 332 : 224 - 235
[40] Human Action Recognition by Learning Spatio-Temporal Features With Deep Neural Networks
Wang, Lei
Xu, Yangyang
Cheng, Jun
Xia, Haiying
Yin, Jianqin
Wu, Jiaji
[J]. IEEE ACCESS, 2018, 6 : 17913 - 17922

← 1 2 3 4 5 →