Video-Based Emotion Recognition using Face Frontalization and Deep Spatiotemporal Feature

被引:0
|
作者
Wang, Jinwei [1 ]
Zhao, Ziping [1 ]
Liang, Jinglian [1 ]
Li, Chao [1 ]
机构
[1] Tianjin Normal Univ, Comp & Inf Engn Coll, Tianjin, Peoples R China
来源
2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA) | 2018年
基金
中国国家自然科学基金;
关键词
emotion recognition; 3D convolutional network; face frontalization; spatiotemporal feature; AUDIO; SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the method used for the Multimodal Emotion Recognition Challenge (MEC) 2017 in the category of video-based emotion recognition. Our approach is based on two core ideas. First, to solve the problem of head-pose variations in video, we use the face frontalization approach, which is generally used in the field of face recognition, to synthesize the front view of the face in each frame through aligning the face to a 3D frontal model while preserving the facial expression information. Second, we use C3D, a deep 3-dimensional convolutional network that can model the appearance and motion of videos simultaneously, to extract spatiotemporal facial features from frontalized face sequences. We also use facial geometric features as a supplement. We tried different combinations of prediction scores output by softmax and linear SVM classifiers for different features to predict emotion. We tested our method on the Chinese Natural Audio-Visual Emotion Database (CHEAVD) 2.0. The experimental results show that our method achieves impressive results in terms of both accuracy and macro average precision, which significantly outperform the baseline.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Joint sparse representation for video-based face recognition
    Cui, Zhen
    Chang, Hong
    Shan, Shiguang
    Ma, Bingpeng
    Chen, Xilin
    NEUROCOMPUTING, 2014, 135 : 306 - 312
  • [42] An adaptive classification system for video-based face recognition
    Connolly, Jean-Francois
    Granger, Eric
    Sabourin, Robert
    INFORMATION SCIENCES, 2012, 192 : 50 - 70
  • [43] Audio-Guided Video-Based Face Recognition
    Tang, Xiaoou
    Li, Zhifeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2009, 19 (07) : 955 - 964
  • [44] A manifold learning algorithm for video-based face recognition
    Lu, Ke
    Ding, Zhengming
    Zhao, Jidong
    Wu, Yue
    Journal of Information and Computational Science, 2011, 8 (09): : 1695 - 1702
  • [45] An automatic system for unconstrained video-based face recognition
    Zheng J.
    Ranjan R.
    Chen C.-H.
    Chen J.-C.
    Castillo C.D.
    Chellappa R.
    IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020, 2 (03): : 194 - 209
  • [46] Joint Space Learning for Video-based Face Recognition
    Cao, Dong
    He, Ran
    Sun, Zhenan
    Tan, Tieniu
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 16 - 20
  • [47] Consistent Sparse Representation for Video-Based Face Recognition
    Liu, Xiuping
    Shen, Aihong
    Zhang, Jie
    Cao, Junjie
    Zhou, Yanfang
    COMPUTER VISION - ACCV 2016, PT III, 2017, 10113 : 404 - 418
  • [48] An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
    Dutt, Micheal
    Goodwin, Morten
    Omlin, Christian W.
    IEEE ACCESS, 2024, 12 : 187947 - 187963
  • [49] Video-Based Emotion Estimation Using Deep Neural Networks: A Comparative Study
    Alchieri, Leonardo
    Celona, Luigi
    Bianco, Simone
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT I, 2024, 14365 : 255 - 269
  • [50] Video-Based Human Activity Recognition Using Deep Learning Approaches
    Surek, Guilherme Augusto Silva
    Seman, Laio Oriel
    Stefenon, Stefano Frizzo
    Mariani, Viviana Cocco
    Coelho, Leandro dos Santos
    SENSORS, 2023, 23 (14)