Video-Based Emotion Recognition using Face Frontalization and Deep Spatiotemporal Feature

被引：0

作者：

Wang, Jinwei ^{[1
]}

Zhao, Ziping ^{[1
]}

Liang, Jinglian ^{[1
]}

Li, Chao ^{[1
]}

机构：

[1] Tianjin Normal Univ, Comp & Inf Engn Coll, Tianjin, Peoples R China

来源：

2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA) | 2018年

基金：

中国国家自然科学基金;

关键词：

emotion recognition; 3D convolutional network; face frontalization; spatiotemporal feature; AUDIO; SYSTEM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present the method used for the Multimodal Emotion Recognition Challenge (MEC) 2017 in the category of video-based emotion recognition. Our approach is based on two core ideas. First, to solve the problem of head-pose variations in video, we use the face frontalization approach, which is generally used in the field of face recognition, to synthesize the front view of the face in each frame through aligning the face to a 3D frontal model while preserving the facial expression information. Second, we use C3D, a deep 3-dimensional convolutional network that can model the appearance and motion of videos simultaneously, to extract spatiotemporal facial features from frontalized face sequences. We also use facial geometric features as a supplement. We tried different combinations of prediction scores output by softmax and linear SVM classifiers for different features to predict emotion. We tested our method on the Chinese Natural Audio-Visual Emotion Database (CHEAVD) 2.0. The experimental results show that our method achieves impressive results in terms of both accuracy and macro average precision, which significantly outperform the baseline.

引用

页数：6

共 50 条

[21] Sparse Representation for Video-Based Face Recognition
Naseem, Imran
Togneri, Roberto
Bennamoun, Mohammed
ADVANCES IN BIOMETRICS, 2009, 5558 : 219 - +
[22] Historical Blurry Video-Based Face Recognition
Zhai, Lujun
Cui, Suxia
Wang, Yonghui
Wang, Song
Zhou, Jun
Wilsbacher, Greg
JOURNAL OF IMAGING, 2024, 10 (09)
[23] Multiple Instance Learning with Deep Instance Selection for Video-based Face Recognition
Liu, Ning
PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MECHATRONICS AND INFORMATION TECHNOLOGY (ICMIT), 2016, 49 : 327 - 332
[24] Dynamic Facial Emotion Recognition Using Deep Spatial Feature and Handcrafted Spatiotemporal Feature On Spark
Uddin, Md Azher
Lee, Young-Koo
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 21 - 27
[25] Video Emotion Recognition with Transferred Deep Feature Encodings
Xu, Baohan
Fu, Yanwei
Jiang, Yu-Gang
Li, Boyang
Sigal, Leonid
ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 15 - 22
[26] Video-based face recognition using adaptive hidden Markov models
Liu, XM
Chen, TH
2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 340 - 345
[27] Video-based face recognition using earth mover's distance
Li, JW
Wang, YH
Tan, TN
AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 229 - 238
[28] Video-based face recognition using a metric of average Euclidean distance
Li, JW
Wang, YH
Tan, TN
ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2004, 3338 : 224 - 232
[29] Video-Based Face Recognition and Face-Tracking using Sparse Representation Based Categorization
Nagendra, Shruthi
Baskaran, R.
Abirami, S.
ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 746 - 755
[30] Video-Based Face Recognition Using Ensemble of Haar-Like Deep Convolutional Neural Networks
Parchami, Mostafa
Bashbaghi, Saman
Granger, Eric
2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4625 - 4632

← 1 2 3 4 5 →