Video-Based Emotion Recognition using Face Frontalization and Deep Spatiotemporal Feature

被引：0

作者：

Wang, Jinwei ^{[1
]}

Zhao, Ziping ^{[1
]}

Liang, Jinglian ^{[1
]}

Li, Chao ^{[1
]}

机构：

[1] Tianjin Normal Univ, Comp & Inf Engn Coll, Tianjin, Peoples R China

来源：

2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA) | 2018年

基金：

中国国家自然科学基金;

关键词：

emotion recognition; 3D convolutional network; face frontalization; spatiotemporal feature; AUDIO; SYSTEM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present the method used for the Multimodal Emotion Recognition Challenge (MEC) 2017 in the category of video-based emotion recognition. Our approach is based on two core ideas. First, to solve the problem of head-pose variations in video, we use the face frontalization approach, which is generally used in the field of face recognition, to synthesize the front view of the face in each frame through aligning the face to a 3D frontal model while preserving the facial expression information. Second, we use C3D, a deep 3-dimensional convolutional network that can model the appearance and motion of videos simultaneously, to extract spatiotemporal facial features from frontalized face sequences. We also use facial geometric features as a supplement. We tried different combinations of prediction scores output by softmax and linear SVM classifiers for different features to predict emotion. We tested our method on the Chinese Natural Audio-Visual Emotion Database (CHEAVD) 2.0. The experimental results show that our method achieves impressive results in terms of both accuracy and macro average precision, which significantly outperform the baseline.

引用

页数：6

共 50 条

[41] Joint sparse representation for video-based face recognition
Cui, Zhen
Chang, Hong
Shan, Shiguang
Ma, Bingpeng
Chen, Xilin
NEUROCOMPUTING, 2014, 135 : 306 - 312
[42] An adaptive classification system for video-based face recognition
Connolly, Jean-Francois
Granger, Eric
Sabourin, Robert
INFORMATION SCIENCES, 2012, 192 : 50 - 70
[43] Audio-Guided Video-Based Face Recognition
Tang, Xiaoou
Li, Zhifeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2009, 19 (07) : 955 - 964
[44] A manifold learning algorithm for video-based face recognition
Lu, Ke
Ding, Zhengming
Zhao, Jidong
Wu, Yue
Journal of Information and Computational Science, 2011, 8 (09): : 1695 - 1702
[45] An automatic system for unconstrained video-based face recognition
Zheng J.
Ranjan R.
Chen C.-H.
Chen J.-C.
Castillo C.D.
Chellappa R.
IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020, 2 (03): : 194 - 209
[46] Joint Space Learning for Video-based Face Recognition
Cao, Dong
He, Ran
Sun, Zhenan
Tan, Tieniu
PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 16 - 20
[47] Consistent Sparse Representation for Video-Based Face Recognition
Liu, Xiuping
Shen, Aihong
Zhang, Jie
Cao, Junjie
Zhou, Yanfang
COMPUTER VISION - ACCV 2016, PT III, 2017, 10113 : 404 - 418
[48] An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
Dutt, Micheal
Goodwin, Morten
Omlin, Christian W.
IEEE ACCESS, 2024, 12 : 187947 - 187963
[49] Video-Based Emotion Estimation Using Deep Neural Networks: A Comparative Study
Alchieri, Leonardo
Celona, Luigi
Bianco, Simone
IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT I, 2024, 14365 : 255 - 269
[50] Video-Based Human Activity Recognition Using Deep Learning Approaches
Surek, Guilherme Augusto Silva
Seman, Laio Oriel
Stefenon, Stefano Frizzo
Mariani, Viviana Cocco
Coelho, Leandro dos Santos
SENSORS, 2023, 23 (14)

← 1 2 3 4 5 →