Video-Based Emotion Recognition using Face Frontalization and Deep Spatiotemporal Feature

被引:0
|
作者
Wang, Jinwei [1 ]
Zhao, Ziping [1 ]
Liang, Jinglian [1 ]
Li, Chao [1 ]
机构
[1] Tianjin Normal Univ, Comp & Inf Engn Coll, Tianjin, Peoples R China
来源
2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA) | 2018年
基金
中国国家自然科学基金;
关键词
emotion recognition; 3D convolutional network; face frontalization; spatiotemporal feature; AUDIO; SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the method used for the Multimodal Emotion Recognition Challenge (MEC) 2017 in the category of video-based emotion recognition. Our approach is based on two core ideas. First, to solve the problem of head-pose variations in video, we use the face frontalization approach, which is generally used in the field of face recognition, to synthesize the front view of the face in each frame through aligning the face to a 3D frontal model while preserving the facial expression information. Second, we use C3D, a deep 3-dimensional convolutional network that can model the appearance and motion of videos simultaneously, to extract spatiotemporal facial features from frontalized face sequences. We also use facial geometric features as a supplement. We tried different combinations of prediction scores output by softmax and linear SVM classifiers for different features to predict emotion. We tested our method on the Chinese Natural Audio-Visual Emotion Database (CHEAVD) 2.0. The experimental results show that our method achieves impressive results in terms of both accuracy and macro average precision, which significantly outperform the baseline.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Sparse Representation for Video-Based Face Recognition
    Naseem, Imran
    Togneri, Roberto
    Bennamoun, Mohammed
    ADVANCES IN BIOMETRICS, 2009, 5558 : 219 - +
  • [22] Historical Blurry Video-Based Face Recognition
    Zhai, Lujun
    Cui, Suxia
    Wang, Yonghui
    Wang, Song
    Zhou, Jun
    Wilsbacher, Greg
    JOURNAL OF IMAGING, 2024, 10 (09)
  • [23] Multiple Instance Learning with Deep Instance Selection for Video-based Face Recognition
    Liu, Ning
    PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MECHATRONICS AND INFORMATION TECHNOLOGY (ICMIT), 2016, 49 : 327 - 332
  • [24] Dynamic Facial Emotion Recognition Using Deep Spatial Feature and Handcrafted Spatiotemporal Feature On Spark
    Uddin, Md Azher
    Lee, Young-Koo
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 21 - 27
  • [25] Video Emotion Recognition with Transferred Deep Feature Encodings
    Xu, Baohan
    Fu, Yanwei
    Jiang, Yu-Gang
    Li, Boyang
    Sigal, Leonid
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 15 - 22
  • [26] Video-based face recognition using adaptive hidden Markov models
    Liu, XM
    Chen, TH
    2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 340 - 345
  • [27] Video-based face recognition using earth mover's distance
    Li, JW
    Wang, YH
    Tan, TN
    AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 229 - 238
  • [28] Video-based face recognition using a metric of average Euclidean distance
    Li, JW
    Wang, YH
    Tan, TN
    ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2004, 3338 : 224 - 232
  • [29] Video-Based Face Recognition and Face-Tracking using Sparse Representation Based Categorization
    Nagendra, Shruthi
    Baskaran, R.
    Abirami, S.
    ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 746 - 755
  • [30] Video-Based Face Recognition Using Ensemble of Haar-Like Deep Convolutional Neural Networks
    Parchami, Mostafa
    Bashbaghi, Saman
    Granger, Eric
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4625 - 4632