Deep Learning Based Video Spatio-Temporal Modeling for Emotion Recognition

被引:5
|
作者
Fonnegra, Ruben D. [1 ]
Diaz, Gloria M. [1 ]
机构
[1] Inst Tecnol Metropolitano, Medellin, Colombia
关键词
Deep learning; Facial emotion recognition; Spatio-temporal modeling; FACIAL EXPRESSION RECOGNITION; DESIGN; SYSTEM;
D O I
10.1007/978-3-319-91238-7_32
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Affective Computing is a growing research area, which aims to determine the emotional user states through their conscious and unconscious actions and use it to modify the machine interaction. This paper investigates the discriminative abilities of convolutional and recurrent neural networks to modeling spatio-temporal features from video sequences of the face region. In a deep learning architecture, dense convolutional layers are used for analyzing spatial information changes in frames during short time periods, while dense recurrent layers are used to model changes in frames as temporal sequences that change across the time. Those layers are then connected to a multilayer perceptron (MLP) to perform the classification task, which consists in to distinguish between six different emotion categories. The performance was twofold evaluated: gender independent and gender-dependent classifications. Experimental results show that the proposed approach achieves an accuracy of 81.84%, in the gender independent experiment, which outperforms previous works using the same experimental data. In the gender-dependent experiment, accuracy was 80.79% and 82.75% for male and female, respectively.
引用
收藏
页码:397 / 408
页数:12
相关论文
共 50 条
  • [31] Spatio-Temporal Image-Based Encoded Atlases for EEG Emotion Recognition
    Avola, Danilo
    Cinque, Luigi
    Mambro, Angelo Di
    Fagioli, Alessio
    Marini, Marco Raoul
    Pannone, Daniele
    Fanini, Bruno
    Foresti, Gian Luca
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2024, 34 (05)
  • [32] Spatio-temporal Video Autoencoder for Human Action Recognition
    Sousa e Santos, Anderson Carlos
    Pedrini, Helio
    [J]. PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 114 - 123
  • [33] Spatio-temporal detection of auroral substorm based on deep learning
    Yang QiuJu
    Ren Jie
    Xiang Han
    [J]. CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2022, 65 (03): : 898 - 907
  • [34] Interpretable Spatio-temporal Attention for Video Action Recognition
    Meng, Lili
    Zhao, Bo
    Chang, Bo
    Huang, Gao
    Sun, Wei
    Tung, Frederich
    Sigal, Leonid
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1513 - 1522
  • [35] Exploiting spatio-temporal knowledge for video action recognition
    Zhang, Huigang
    Wang, Liuan
    Sun, Jun
    [J]. IET COMPUTER VISION, 2023, 17 (02) : 222 - 230
  • [36] Kronecker PCA Based Spatio-Temporal Modeling of Video for Dismount Classification
    Greenewald, Kristjan H.
    Hero, Alfred O., III
    [J]. ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXI, 2014, 9093
  • [37] LEARNING SPATIO-TEMPORAL DEPENDENCIES FOR ACTION RECOGNITION
    Cai, Qiao
    Yin, Yafeng
    Man, Hong
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3740 - 3744
  • [38] Spatio-Temporal Context Modeling for BoW-Based Video Classification
    Yi, Saehoon
    Pavlovic, Vladimir
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 779 - 786
  • [39] Video summarization via spatio-temporal deep architecture
    Zhong, Sheng-hua
    Wu, Jiaxin
    Jiang, Jianmin
    [J]. NEUROCOMPUTING, 2019, 332 : 224 - 235
  • [40] Human Action Recognition by Learning Spatio-Temporal Features With Deep Neural Networks
    Wang, Lei
    Xu, Yangyang
    Cheng, Jun
    Xia, Haiying
    Yin, Jianqin
    Wu, Jiaji
    [J]. IEEE ACCESS, 2018, 6 : 17913 - 17922