Recognition of facial actions and their temporal segments based on duration models

被引:0
|
作者
Isabel Gonzalez
Francesco Cartella
Valentin Enescu
Hichem Sahli
机构
[1] Vrije Universiteit Brussel (VUB),Department Electronics and Informatics, VUB
[2] Interuniveristy Microelectronics Center (IMEC),NPU Joint AVSP Lab
来源
关键词
Facial action units (AUs); Hidden semi-Markov models (HSMMs); Variable duration semi-Markov model (VDHMM);
D O I
暂无
中图分类号
学科分类号
摘要
Being able to automatically analyze finegrained changes in facial expression into action units (AUs), of the Facial Action Coding System (FACS), and their temporal models (i.e., sequences of temporal phases, neutral, onset, apex, and offset), in face videos would greatly benefit for facial expression recognition systems. Previous works, considered combining, per AU, a discriminative frame-based Support Vector Machine (SVM) and a dynamic generative Hidden Markov Models (HMM), to detect the presence of the AU in question and its temporal segments in an input image sequence. The major drawback of HMMs, is that they do not model well time dependent dynamics as the ones of AUs, especially when dealing with spontaneous expressions. To alleviate this problem, in this paper, we exploit efficient duration modeling of the temporal behavior of AUs, and we propose hidden semi-Markov model (HSMM) and variable duration semi-Markov model (VDHMM) to recognize the dynamics of AU’s. Such models allow the parameterization and inference of the AU’s state duration distributions. Within our system, geometrical and appearance based measurements, as well as their first derivatives, modeling both the dynamics and the appearance of AUs, are applied to pair-wise SVM classifiers for a frame-based classification. The output of which are then fed as evidence to the HSMM or VDHMM for inferring AUs temporal phases. A thorough investigation into the aspect of duration modeling and its application to AU recognition through extensive comparison to state-of-art SVM-HMM approaches are presented. For comparison, an average recognition rate of 64.83 % and 64.66 % is achieved for the HSMM and VDHMM respectively. Our framework has several benefits: (1) it models the AU’s temporal phases duration; (2) it does not require any assumption about the underlying structure of the AU events, and (3) compared to HMM, the proposed HSMM and VDHMM duration models reduce the duration error of the temporal phases of an AU, and they are especially better in recognizing the offset ending of an AU.
引用
收藏
页码:10001 / 10024
页数:23
相关论文
共 50 条
  • [1] Recognition of facial actions and their temporal segments based on duration models
    Gonzalez, Isabel
    Cartella, Francesco
    Enescu, Valentin
    Sahli, Hichem
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 10001 - 10024
  • [2] A Dynamic Texture-Based Approach to Recognition of Facial Actions and Their Temporal Models
    Koelstra, Sander
    Pantic, Maja
    Patras, Ioannis
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (11) : 1940 - 1954
  • [3] Dynamics of facial expression: Recognition of facial actions and their temporal segments from face profile image sequences
    Pantic, Maja
    Patras, Ioannis
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2006, 36 (02): : 433 - 449
  • [4] Fully Automatic Recognition of the Temporal Phases of Facial Actions
    Valstar, Michel F.
    Pantic, Maja
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (01): : 28 - 43
  • [5] Detecting facial actions and their temporal segments in nearly frontal-view face image sequences
    Pantic, M
    Patras, I
    [J]. INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 3358 - 3363
  • [6] Facial Expression Recognition Based on Auxiliary Models
    Wang, Yingying
    Li, Yibin
    Song, Yong
    Rong, Xuewen
    [J]. ALGORITHMS, 2019, 12 (11)
  • [7] EXEMPLAR BASED LANGUAGE RECOGNITION METHOD FOR SHORT-DURATION SPEECH SEGMENTS
    Wang, Meng-Ge
    Song, Yan
    Jiang, Bing
    Dai, Li-Rong
    McLoulghlin, Ian
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7354 - 7358
  • [8] Privacy-preserving facial recognition based on temporal features
    Leong, Shu-Min
    Phan, Raphael C-W
    Baskaran, Vishnu Monn
    Ooi, Chee-Pun
    [J]. APPLIED SOFT COMPUTING, 2020, 96
  • [9] Temporal based Emotion Recognition inspired by Activity Recognition models
    Mohan, Balaganesh
    Popa, Mirela
    [J]. 2021 9TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2021,
  • [10] An expert system for recognition of facial actions and their intensity
    Pantic, M
    Rothkrantz, LJM
    [J]. SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 1026 - 1033