Deep Structured Learning for Facial Action Unit Intensity Estimation

被引:53
|
作者
Walecki, Robert [1 ]
Rudovic, Ognjen [2 ]
Pavlovic, Vladimir [3 ]
Schuller, Bjoern [1 ]
Pantic, Maja [1 ,4 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
[2] MIT, Media Lab, Cambridge, MA 02139 USA
[3] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ USA
[4] Univ Twente, EEMCS, Enschede, Netherlands
基金
美国国家科学基金会; 欧盟地平线“2020”;
关键词
D O I
10.1109/CVPR.2017.605
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the task of automated estimation of facial expression intensity. This involves estimation of multiple output variables (facial action units - AUs) that are structurally dependent. Their structure arises from statistically induced co-occurrence patterns of AU intensity levels. Modeling this structure is critical for improving the estimation performance; however, this performance is bounded by the quality of the input features extracted from face images. The goal of this paper is to model these structures and estimate complex feature representations simultaneously by combining conditional random field (CRF) encoded AU dependencies with deep learning. To this end, we propose a novel Copula CNN deep learning approach for modeling multivariate ordinal variables. Our model accounts for ordinal structure in output variables and their non-linear dependencies via copula functions modeled as cliques of a CRF. These are jointly optimized with deep CNN feature encoding layers using a newly introduced balanced batch iterative training algorithm. We demonstrate the effectiveness of our approach on the task of AU intensity estimation on two benchmark datasets. We show that joint learning of the deep features and the target output structure results in significant performance gains compared to existing deep structured models for analysis of facial expressions.
引用
收藏
页码:5709 / 5718
页数:10
相关论文
共 50 条
  • [1] Joint Representation and Estimator Learning for Facial Action Unit Intensity Estimation
    Zhang, Yong
    Wu, Baoyuan
    Dong, Weiming
    Li, Zhifeng
    Liu, Wei
    Hu, Bao-Gang
    Ji, Qiang
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3452 - 3461
  • [2] Weakly-supervised Deep Convolutional Neural Network Learning for Facial Action Unit Intensity Estimation
    Zhang, Yong
    Dong, Weiming
    Hu, Bao-Gang
    Ji, Qiang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2314 - 2323
  • [3] Deep Learning based FACS Action Unit Occurrence and Intensity Estimation
    Gudi, Amogh
    Tasli, H. Emrah
    den Uyl, Tim M.
    Maroulis, Andreas
    [J]. 2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 6, 2015,
  • [4] Deep Facial Action Unit Recognition and Intensity Estimation from Partially Labelled Data
    Wang, Shangfei
    Pan, Bowen
    Wu, Shan
    Ji, Qiang
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (04) : 1018 - 1030
  • [5] Facial Action Unit Intensity Estimation via Semantic Correspondence Learning with Dynamic Graph Convolution
    Fan, Yingruo
    Lam, Jacqueline C. K.
    Li, Victor O. K.
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12701 - 12708
  • [6] Unsupervised Learning Facial Parameter Regressor for Action Unit Intensity Estimation via Differentiable Renderer
    Song, Xinhui
    Shi, Tianyang
    Feng, Zunlei
    Song, Mingli
    Lin, Jackie
    Lin, Chuanjie
    Fan, Changjie
    Yuan, Yi
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2842 - 2851
  • [7] Dynamic Probabilistic Graph Convolution for Facial Action Unit Intensity Estimation
    Song, Tengfei
    Cui, Zijun
    Wang, Yuru
    Zheng, Wenming
    Ji, Qiang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4843 - 4852
  • [8] A Framework for Joint Estimation and Guided Annotation of Facial Action Unit Intensity
    Walecki, Robert
    Rudovic, Ognjen
    Pantic, Maja
    Pavlovic, Vladimir
    Cohn, Jeffrey F.
    [J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 1460 - 1468
  • [9] Markov Random Field Structures for Facial Action Unit Intensity Estimation
    Sandbach, Georgia
    Zafeiriou, Stefanos
    Pantic, Maja
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 738 - 745
  • [10] Copula Ordinal Regression for Joint Estimation of Facial Action Unit Intensity
    Walecki, Robert
    Rudovic, Ognjen
    Pavlovic, Vladimir
    Pantic, Maja
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4902 - 4910