Recognition of facial actions and their temporal segments based on duration models

被引：0

作者：

Isabel Gonzalez

Francesco Cartella

Valentin Enescu

Hichem Sahli

机构：

[1] Vrije Universiteit Brussel (VUB),Department Electronics and Informatics, VUB

[2] Interuniveristy Microelectronics Center (IMEC),NPU Joint AVSP Lab

来源：

Multimedia Tools and Applications | 2015年 / 74卷

关键词：

Facial action units (AUs); Hidden semi-Markov models (HSMMs); Variable duration semi-Markov model (VDHMM);

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Being able to automatically analyze finegrained changes in facial expression into action units (AUs), of the Facial Action Coding System (FACS), and their temporal models (i.e., sequences of temporal phases, neutral, onset, apex, and offset), in face videos would greatly benefit for facial expression recognition systems. Previous works, considered combining, per AU, a discriminative frame-based Support Vector Machine (SVM) and a dynamic generative Hidden Markov Models (HMM), to detect the presence of the AU in question and its temporal segments in an input image sequence. The major drawback of HMMs, is that they do not model well time dependent dynamics as the ones of AUs, especially when dealing with spontaneous expressions. To alleviate this problem, in this paper, we exploit efficient duration modeling of the temporal behavior of AUs, and we propose hidden semi-Markov model (HSMM) and variable duration semi-Markov model (VDHMM) to recognize the dynamics of AU’s. Such models allow the parameterization and inference of the AU’s state duration distributions. Within our system, geometrical and appearance based measurements, as well as their first derivatives, modeling both the dynamics and the appearance of AUs, are applied to pair-wise SVM classifiers for a frame-based classification. The output of which are then fed as evidence to the HSMM or VDHMM for inferring AUs temporal phases. A thorough investigation into the aspect of duration modeling and its application to AU recognition through extensive comparison to state-of-art SVM-HMM approaches are presented. For comparison, an average recognition rate of 64.83 % and 64.66 % is achieved for the HSMM and VDHMM respectively. Our framework has several benefits: (1) it models the AU’s temporal phases duration; (2) it does not require any assumption about the underlying structure of the AU events, and (3) compared to HMM, the proposed HSMM and VDHMM duration models reduce the duration error of the temporal phases of an AU, and they are especially better in recognizing the offset ending of an AU.

引用

页码：10001 / 10024

页数：23

共 50 条

[21] Non-rigid registration using free-form deformations for recognition of facial actions and their temporal dynamics
Koelstra, Sander
Pantic, Maja
[J]. 2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 920 - +
[22] Facial Expression Recognition Based on Combination of Spatio-temporal and Spectral Features in Local Facial Regions
Abounasr, Nakisa
Pourghassem, Hossein
[J]. 2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 446 - 450
[23] Temporal weights in the perception of sound intensity: Effects of sound duration and number of temporal segments
[J]. 1600, Acoustical Society of America (143):
[24] Temporal weights in the perception of sound intensity: Effects of sound duration and number of temporal segments
Oberfeld, Daniel
Hots, Jan
Verhey, Jesko L.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (02): : 943 - 953
[25] Word segments in category-based language models for automatic speech recognition
Justo, Raquel
Torres, M. Ines
[J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 1, PROCEEDINGS, 2007, 4477 : 249 - +
[26] Affective actions recognition in dyadic interactions based on generative and discriminative models
Yang, Ning
Wang, Zhelong
Zhao, Hongyu
Li, Jie
Qiu, Sen
[J]. SENSOR REVIEW, 2020, 40 (05) : 605 - 615
[27] Facial Expression Recognition with Temporal Modeling of Shapes
Jain, Suyog
Hu, Changbo
Aggarwal, J. K.
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
[28] A Temporal Approach to Facial Emotion Expression Recognition
Asaju, Christine
Vadapalli, Hima
[J]. ARTIFICIAL INTELLIGENCE RESEARCH, SACAIR 2021, 2022, 1551 : 274 - 286
[29] Sparse Temporal Representations for Facial Expression Recognition
Chew, S. W.
Rana, R.
Lucey, P.
Lucey, S.
Sridharan, S.
[J]. ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PT II, 2011, 7088 : 311 - +
[30] Facial Expression Recognition Based on Spatial-Temporal Fusion with Attention Mechanism
Zhang, Lifeng
Zheng, Xiangwei
Chen, Xuanchi
Ren, Xiuxiu
Ji, Cun
[J]. NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6109 - 6124

← 1 2 3 4 5 →