Video-based Emotion Recognition Using Deeply-Supervised Neural Networks

被引:47
|
作者
Fan, Yingruo [1 ]
Lam, Jacqueline C. K. [1 ]
Li, Victor O. K. [1 ]
机构
[1] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Peoples R China
关键词
Emotion Recognition; Deeply-Supervised; Side-output Layers; Convolutional Neural Network; EmotiW; 2018; Challenge;
D O I
10.1145/3242969.3264978
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Emotion recognition (ER) based on natural facial images/videos has been studied for some years and considered a comparatively hot topic in the field of affective computing. However, it remains a challenge to perform ER in the wild, given the noises generated from head pose, face deformation, and illumination variation. To address this challenge, motivated by recent progress in Convolutional Neural Network (CNN), we develop a novel deeply supervised CNN (DSN) architecture, taking the multi-level and multi scale features extracted from different convolutional layers to provide a more advanced representation of ER. By embedding a series of side-output layers, our DSN model provides class-wise supervision and integrates predictions from multiple layers. Finally, our team ranked 3rd at the EmotiW 2018 challenge with our model achieving an accuracy of 61.1%.
引用
收藏
页码:584 / 588
页数:5
相关论文
共 50 条
  • [31] Adaptive metric learning with deep neural networks for video-based facial expression recognition
    Liu, Xiaofeng
    Ge, Yubin
    Yang, Chao
    Jia, Ping
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (01)
  • [32] Trunk-Branch Ensemble Convolutional Neural Networks for Video-Based Face Recognition
    Ding, Changxing
    Tao, Dacheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 1002 - 1014
  • [33] Real-time human action recognition using raw depth video-based recurrent neural networks
    Sanchez-Caballero, Adrian
    Fuentes-Jimenez, David
    Losada-Gutierrez, Cristina
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16213 - 16235
  • [34] Video-Based Chinese Sign Language Recognition Using Convolutional Neural Network
    Yang, Su
    Zhu, Qing
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 929 - 934
  • [35] CONVOLUTIONAL NEURAL TREE FOR VIDEO-BASED FACIAL EXPRESSION RECOGNITION EMBEDDING EMOTION WHEEL AS INDUCTIVE BIAS
    Miyoshi, Ryo
    Akizuki, Shuichi
    Tobitani, Kensuke
    Nagata, Noriko
    Hashimoto, Manabu
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3261 - 3265
  • [36] Real-time human action recognition using raw depth video-based recurrent neural networks
    Adrián Sánchez-Caballero
    David Fuentes-Jiménez
    Cristina Losada-Gutiérrez
    Multimedia Tools and Applications, 2023, 82 : 16213 - 16235
  • [37] Multi-Attention Fusion Network for Video-based Emotion Recognition
    Wang, Yanan
    Wu, Jianming
    Hoashi, Keiichiro
    ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 595 - 601
  • [38] A Video-Based Cognitive Emotion Recognition Method Using an Active Learning Algorithm Based on Complexity and Uncertainty
    Wu, Hongduo
    Zhou, Dong
    Guo, Ziyue
    Song, Zicheng
    Li, Yu
    Wei, Xingzheng
    Zhou, Qidi
    APPLIED SCIENCES-BASEL, 2025, 15 (01):
  • [39] Emotion recognition in speech using neural networks
    Nicholson, J
    Takahashi, K
    Nakatsu, R
    AFFECTIVE MINDS, 2000, : 215 - 220
  • [40] Emotion recognition in speech using neural networks
    Nicholson, J
    Takahashi, K
    Nakatsu, R
    NEURAL COMPUTING & APPLICATIONS, 2000, 9 (04): : 290 - 296