Recognition of facial actions and their temporal segments based on duration models

被引:0
|
作者
Isabel Gonzalez
Francesco Cartella
Valentin Enescu
Hichem Sahli
机构
[1] Vrije Universiteit Brussel (VUB),Department Electronics and Informatics, VUB
[2] Interuniveristy Microelectronics Center (IMEC),NPU Joint AVSP Lab
来源
关键词
Facial action units (AUs); Hidden semi-Markov models (HSMMs); Variable duration semi-Markov model (VDHMM);
D O I
暂无
中图分类号
学科分类号
摘要
Being able to automatically analyze finegrained changes in facial expression into action units (AUs), of the Facial Action Coding System (FACS), and their temporal models (i.e., sequences of temporal phases, neutral, onset, apex, and offset), in face videos would greatly benefit for facial expression recognition systems. Previous works, considered combining, per AU, a discriminative frame-based Support Vector Machine (SVM) and a dynamic generative Hidden Markov Models (HMM), to detect the presence of the AU in question and its temporal segments in an input image sequence. The major drawback of HMMs, is that they do not model well time dependent dynamics as the ones of AUs, especially when dealing with spontaneous expressions. To alleviate this problem, in this paper, we exploit efficient duration modeling of the temporal behavior of AUs, and we propose hidden semi-Markov model (HSMM) and variable duration semi-Markov model (VDHMM) to recognize the dynamics of AU’s. Such models allow the parameterization and inference of the AU’s state duration distributions. Within our system, geometrical and appearance based measurements, as well as their first derivatives, modeling both the dynamics and the appearance of AUs, are applied to pair-wise SVM classifiers for a frame-based classification. The output of which are then fed as evidence to the HSMM or VDHMM for inferring AUs temporal phases. A thorough investigation into the aspect of duration modeling and its application to AU recognition through extensive comparison to state-of-art SVM-HMM approaches are presented. For comparison, an average recognition rate of 64.83 % and 64.66 % is achieved for the HSMM and VDHMM respectively. Our framework has several benefits: (1) it models the AU’s temporal phases duration; (2) it does not require any assumption about the underlying structure of the AU events, and (3) compared to HMM, the proposed HSMM and VDHMM duration models reduce the duration error of the temporal phases of an AU, and they are especially better in recognizing the offset ending of an AU.
引用
收藏
页码:10001 / 10024
页数:23
相关论文
共 50 条
  • [21] Non-rigid registration using free-form deformations for recognition of facial actions and their temporal dynamics
    Koelstra, Sander
    Pantic, Maja
    [J]. 2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 920 - +
  • [22] Facial Expression Recognition Based on Combination of Spatio-temporal and Spectral Features in Local Facial Regions
    Abounasr, Nakisa
    Pourghassem, Hossein
    [J]. 2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 446 - 450
  • [24] Temporal weights in the perception of sound intensity: Effects of sound duration and number of temporal segments
    Oberfeld, Daniel
    Hots, Jan
    Verhey, Jesko L.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (02): : 943 - 953
  • [25] Word segments in category-based language models for automatic speech recognition
    Justo, Raquel
    Torres, M. Ines
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 1, PROCEEDINGS, 2007, 4477 : 249 - +
  • [26] Affective actions recognition in dyadic interactions based on generative and discriminative models
    Yang, Ning
    Wang, Zhelong
    Zhao, Hongyu
    Li, Jie
    Qiu, Sen
    [J]. SENSOR REVIEW, 2020, 40 (05) : 605 - 615
  • [27] Facial Expression Recognition with Temporal Modeling of Shapes
    Jain, Suyog
    Hu, Changbo
    Aggarwal, J. K.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [28] A Temporal Approach to Facial Emotion Expression Recognition
    Asaju, Christine
    Vadapalli, Hima
    [J]. ARTIFICIAL INTELLIGENCE RESEARCH, SACAIR 2021, 2022, 1551 : 274 - 286
  • [29] Sparse Temporal Representations for Facial Expression Recognition
    Chew, S. W.
    Rana, R.
    Lucey, P.
    Lucey, S.
    Sridharan, S.
    [J]. ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PT II, 2011, 7088 : 311 - +
  • [30] Facial Expression Recognition Based on Spatial-Temporal Fusion with Attention Mechanism
    Zhang, Lifeng
    Zheng, Xiangwei
    Chen, Xuanchi
    Ren, Xiuxiu
    Ji, Cun
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6109 - 6124