Video-Based Emotion Recognition using Face Frontalization and Deep Spatiotemporal Feature

被引：0

作者：

Wang, Jinwei ^{[1
]}

Zhao, Ziping ^{[1
]}

Liang, Jinglian ^{[1
]}

Li, Chao ^{[1
]}

机构：

[1] Tianjin Normal Univ, Comp & Inf Engn Coll, Tianjin, Peoples R China

来源：

2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA) | 2018年

基金：

中国国家自然科学基金;

关键词：

emotion recognition; 3D convolutional network; face frontalization; spatiotemporal feature; AUDIO; SYSTEM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present the method used for the Multimodal Emotion Recognition Challenge (MEC) 2017 in the category of video-based emotion recognition. Our approach is based on two core ideas. First, to solve the problem of head-pose variations in video, we use the face frontalization approach, which is generally used in the field of face recognition, to synthesize the front view of the face in each frame through aligning the face to a 3D frontal model while preserving the facial expression information. Second, we use C3D, a deep 3-dimensional convolutional network that can model the appearance and motion of videos simultaneously, to extract spatiotemporal facial features from frontalized face sequences. We also use facial geometric features as a supplement. We tried different combinations of prediction scores output by softmax and linear SVM classifiers for different features to predict emotion. We tested our method on the Chinese Natural Audio-Visual Emotion Database (CHEAVD) 2.0. The experimental results show that our method achieves impressive results in terms of both accuracy and macro average precision, which significantly outperform the baseline.

引用

页数：6

共 50 条

[1] Video-based driver emotion recognition using hybrid deep spatio-temporal feature learning
Varma, Harshit
Ganapathy, Nagarajan
Deserno, Thomas M.
MEDICAL IMAGING 2022: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2022, 12037
[2] Complex Wavelet Feature Extraction for Video-based Face Recognition
Zhang, Ping
IEEE SOUTHEASTCON 2010: ENERGIZING OUR FUTURE, 2010, : 440 - 443
[3] Feature Subspace Determination in Video-based Mismatched Face Recognition
Choi, Jae Young
Ro, Yong Man
Plataniotis, Konstantinos N.
2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 158 - +
[4] Video-based face recognition based on deep convolutional neural network
Zhai, Yilong
He, Dongzhi
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 23 - 27
[5] Video-based emotion recognition in the wild using deep transfer learning and score fusion
Kaya, Heysem
Gurpinar, Furkan
Salah, Albert Ali
IMAGE AND VISION COMPUTING, 2017, 65 : 66 - 75
[6] Video-based face recognition using tensor and clustering
Zhao, Jidong, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
[7] Video-based face recognition using relevance feedback
Lu, Ke
Ding, Zhengming
Zhao, Jidong
Wu, Yue
Journal of Information and Computational Science, 2010, 7 (14): : 3217 - 3224
[8] Video-based Face Recognition Using the POEM Descriptor
Nasiri, Saeid
Ghahnavieh, Amir Ebrahimi
Raie, Abolghasem A.
2014 22ND IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2014, : 1125 - 1129
[9] Video-based framework for face recognition in video
Gorodnichy, DO
2ND CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION, PROCEEDINGS, 2005, : 330 - 338
[10] Video-based face outline recognition
Dong, Xingbo
Yang, Jiewen
Teoh, Andrew Beng Jin
Yu, Dahai
Li, Xiaomeng
Jin, Zhe
PATTERN RECOGNITION, 2024, 152

← 1 2 3 4 5 →