Recognition of facial actions and their temporal segments based on duration models

被引：0

作者：

Isabel Gonzalez

Francesco Cartella

Valentin Enescu

Hichem Sahli

机构：

[1] Vrije Universiteit Brussel (VUB),Department Electronics and Informatics, VUB

[2] Interuniveristy Microelectronics Center (IMEC),NPU Joint AVSP Lab

来源：

Multimedia Tools and Applications | 2015年 / 74卷

关键词：

Facial action units (AUs); Hidden semi-Markov models (HSMMs); Variable duration semi-Markov model (VDHMM);

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Being able to automatically analyze finegrained changes in facial expression into action units (AUs), of the Facial Action Coding System (FACS), and their temporal models (i.e., sequences of temporal phases, neutral, onset, apex, and offset), in face videos would greatly benefit for facial expression recognition systems. Previous works, considered combining, per AU, a discriminative frame-based Support Vector Machine (SVM) and a dynamic generative Hidden Markov Models (HMM), to detect the presence of the AU in question and its temporal segments in an input image sequence. The major drawback of HMMs, is that they do not model well time dependent dynamics as the ones of AUs, especially when dealing with spontaneous expressions. To alleviate this problem, in this paper, we exploit efficient duration modeling of the temporal behavior of AUs, and we propose hidden semi-Markov model (HSMM) and variable duration semi-Markov model (VDHMM) to recognize the dynamics of AU’s. Such models allow the parameterization and inference of the AU’s state duration distributions. Within our system, geometrical and appearance based measurements, as well as their first derivatives, modeling both the dynamics and the appearance of AUs, are applied to pair-wise SVM classifiers for a frame-based classification. The output of which are then fed as evidence to the HSMM or VDHMM for inferring AUs temporal phases. A thorough investigation into the aspect of duration modeling and its application to AU recognition through extensive comparison to state-of-art SVM-HMM approaches are presented. For comparison, an average recognition rate of 64.83 % and 64.66 % is achieved for the HSMM and VDHMM respectively. Our framework has several benefits: (1) it models the AU’s temporal phases duration; (2) it does not require any assumption about the underlying structure of the AU events, and (3) compared to HMM, the proposed HSMM and VDHMM duration models reduce the duration error of the temporal phases of an AU, and they are especially better in recognizing the offset ending of an AU.

引用

页码：10001 / 10024

页数：23

共 50 条

[1] Recognition of facial actions and their temporal segments based on duration models
Gonzalez, Isabel
Cartella, Francesco
Enescu, Valentin
Sahli, Hichem
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 10001 - 10024
[2] A Dynamic Texture-Based Approach to Recognition of Facial Actions and Their Temporal Models
Koelstra, Sander
Pantic, Maja
Patras, Ioannis
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (11) : 1940 - 1954
[3] Dynamics of facial expression: Recognition of facial actions and their temporal segments from face profile image sequences
Pantic, Maja
Patras, Ioannis
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2006, 36 (02): : 433 - 449
[4] Fully Automatic Recognition of the Temporal Phases of Facial Actions
Valstar, Michel F.
Pantic, Maja
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (01): : 28 - 43
[5] Detecting facial actions and their temporal segments in nearly frontal-view face image sequences
Pantic, M
Patras, I
[J]. INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 3358 - 3363
[6] Facial Expression Recognition Based on Auxiliary Models
Wang, Yingying
Li, Yibin
Song, Yong
Rong, Xuewen
[J]. ALGORITHMS, 2019, 12 (11)
[7] EXEMPLAR BASED LANGUAGE RECOGNITION METHOD FOR SHORT-DURATION SPEECH SEGMENTS
Wang, Meng-Ge
Song, Yan
Jiang, Bing
Dai, Li-Rong
McLoulghlin, Ian
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7354 - 7358
[8] Privacy-preserving facial recognition based on temporal features
Leong, Shu-Min
Phan, Raphael C-W
Baskaran, Vishnu Monn
Ooi, Chee-Pun
[J]. APPLIED SOFT COMPUTING, 2020, 96
[9] Temporal based Emotion Recognition inspired by Activity Recognition models
Mohan, Balaganesh
Popa, Mirela
[J]. 2021 9TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2021,
[10] An expert system for recognition of facial actions and their intensity
Pantic, M
Rothkrantz, LJM
[J]. SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 1026 - 1033

← 1 2 3 4 5 →