Deep Learning Based Video Spatio-Temporal Modeling for Emotion Recognition

被引:5
|
作者
Fonnegra, Ruben D. [1 ]
Diaz, Gloria M. [1 ]
机构
[1] Inst Tecnol Metropolitano, Medellin, Colombia
关键词
Deep learning; Facial emotion recognition; Spatio-temporal modeling; FACIAL EXPRESSION RECOGNITION; DESIGN; SYSTEM;
D O I
10.1007/978-3-319-91238-7_32
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Affective Computing is a growing research area, which aims to determine the emotional user states through their conscious and unconscious actions and use it to modify the machine interaction. This paper investigates the discriminative abilities of convolutional and recurrent neural networks to modeling spatio-temporal features from video sequences of the face region. In a deep learning architecture, dense convolutional layers are used for analyzing spatial information changes in frames during short time periods, while dense recurrent layers are used to model changes in frames as temporal sequences that change across the time. Those layers are then connected to a multilayer perceptron (MLP) to perform the classification task, which consists in to distinguish between six different emotion categories. The performance was twofold evaluated: gender independent and gender-dependent classifications. Experimental results show that the proposed approach achieves an accuracy of 81.84%, in the gender independent experiment, which outperforms previous works using the same experimental data. In the gender-dependent experiment, accuracy was 80.79% and 82.75% for male and female, respectively.
引用
收藏
页码:397 / 408
页数:12
相关论文
共 50 条
  • [21] Spatio-Temporal Deep Residual Network with Hierarchical Attentions for Video Event Recognition
    Li, Yonggang
    Liu, Chunping
    Ji, Yi
    Gong, Shengrong
    Xu, Haibao
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (02)
  • [22] Video modeling via spatio-temporal adaptive localized learning (STALL)
    Zheng, Yunfei
    Li, Xin
    [J]. 2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 979 - +
  • [23] Drought prediction in Jilin Province based on deep learning and spatio-temporal sequence modeling
    Hou, Zhaojun
    Wang, Beibei
    Zhang, Yichen
    Zhang, Jiquan
    Song, Jingyuan
    [J]. JOURNAL OF HYDROLOGY, 2024, 642
  • [24] Spatio-Temporal Encoder-Decoder Fully Convolutional Network for Video-Based Dimensional Emotion Recognition
    Du, Zhengyin
    Wu, Suowei
    Huang, Di
    Li, Weixin
    Wang, Yunhong
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (03) : 565 - 578
  • [25] Deep spatio-temporal feature fusion with compact bilinear pooling for multimodal emotion recognition
    Dung Nguyen
    Kien Nguyen
    Sridharan, Sridha
    Dean, David
    Fookes, Clinton
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 174 : 33 - 42
  • [26] Video Action Recognition Based on Spatio-temporal Feature Pyramid Module
    Gong, Suming
    Chen, Ying
    [J]. 2020 13TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2020), 2020, : 338 - 341
  • [27] Video Behavior Recognition of Dairy Cows Based on Spatio-temporal Features
    Wang, Kejian
    Sun, Yifei
    Si, Yongsheng
    Han, Xianzhong
    He, Zhenxue
    [J]. Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (05): : 261 - 267
  • [28] Adversarial Spatio-Temporal Learning for Video Deblurring
    Zhang, Kaihao
    Luo, Wenhan
    Zhong, Yiran
    Ma, Lin
    Liu, Wei
    Li, Hongdong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 291 - 301
  • [29] Video Fingerprint Algorithm Based on Spatio-Temporal Deep Neural Network
    Wang Dongdong
    Li Yuenan
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (01)
  • [30] Attention-based Spatio-Temporal Graphic LSTM for EEG Emotion Recognition
    Li, Xiaoxu
    Zheng, Wenming
    Zong, Yuan
    Chang, Hongli
    Lu, Cheng
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,