Video-Based Emotion Recognition using Face Frontalization and Deep Spatiotemporal Feature

被引:0
|
作者
Wang, Jinwei [1 ]
Zhao, Ziping [1 ]
Liang, Jinglian [1 ]
Li, Chao [1 ]
机构
[1] Tianjin Normal Univ, Comp & Inf Engn Coll, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
emotion recognition; 3D convolutional network; face frontalization; spatiotemporal feature; AUDIO; SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the method used for the Multimodal Emotion Recognition Challenge (MEC) 2017 in the category of video-based emotion recognition. Our approach is based on two core ideas. First, to solve the problem of head-pose variations in video, we use the face frontalization approach, which is generally used in the field of face recognition, to synthesize the front view of the face in each frame through aligning the face to a 3D frontal model while preserving the facial expression information. Second, we use C3D, a deep 3-dimensional convolutional network that can model the appearance and motion of videos simultaneously, to extract spatiotemporal facial features from frontalized face sequences. We also use facial geometric features as a supplement. We tried different combinations of prediction scores output by softmax and linear SVM classifiers for different features to predict emotion. We tested our method on the Chinese Natural Audio-Visual Emotion Database (CHEAVD) 2.0. The experimental results show that our method achieves impressive results in terms of both accuracy and macro average precision, which significantly outperform the baseline.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Video-based driver emotion recognition using hybrid deep spatio-temporal feature learning
    Varma, Harshit
    Ganapathy, Nagarajan
    Deserno, Thomas M.
    MEDICAL IMAGING 2022: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2022, 12037
  • [2] Complex Wavelet Feature Extraction for Video-based Face Recognition
    Zhang, Ping
    IEEE SOUTHEASTCON 2010: ENERGIZING OUR FUTURE, 2010, : 440 - 443
  • [3] Feature Subspace Determination in Video-based Mismatched Face Recognition
    Choi, Jae Young
    Ro, Yong Man
    Plataniotis, Konstantinos N.
    2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 158 - +
  • [4] Video-based face recognition based on deep convolutional neural network
    Zhai, Yilong
    He, Dongzhi
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 23 - 27
  • [5] Video-based emotion recognition in the wild using deep transfer learning and score fusion
    Kaya, Heysem
    Gurpinar, Furkan
    Salah, Albert Ali
    IMAGE AND VISION COMPUTING, 2017, 65 : 66 - 75
  • [6] Video-based face recognition using tensor and clustering
    Zhao, Jidong, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
  • [7] Video-based face recognition using relevance feedback
    Lu, Ke
    Ding, Zhengming
    Zhao, Jidong
    Wu, Yue
    Journal of Information and Computational Science, 2010, 7 (14): : 3217 - 3224
  • [8] Video-based Face Recognition Using the POEM Descriptor
    Nasiri, Saeid
    Ghahnavieh, Amir Ebrahimi
    Raie, Abolghasem A.
    2014 22ND IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2014, : 1125 - 1129
  • [9] Video-based framework for face recognition in video
    Gorodnichy, DO
    2ND CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION, PROCEEDINGS, 2005, : 330 - 338
  • [10] Video-based face outline recognition
    Dong, Xingbo
    Yang, Jiewen
    Teoh, Andrew Beng Jin
    Yu, Dahai
    Li, Xiaomeng
    Jin, Zhe
    PATTERN RECOGNITION, 2024, 152