Deep Structured Learning for Facial Action Unit Intensity Estimation

被引:53
|
作者
Walecki, Robert [1 ]
Rudovic, Ognjen [2 ]
Pavlovic, Vladimir [3 ]
Schuller, Bjoern [1 ]
Pantic, Maja [1 ,4 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
[2] MIT, Media Lab, Cambridge, MA 02139 USA
[3] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ USA
[4] Univ Twente, EEMCS, Enschede, Netherlands
基金
美国国家科学基金会; 欧盟地平线“2020”;
关键词
D O I
10.1109/CVPR.2017.605
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the task of automated estimation of facial expression intensity. This involves estimation of multiple output variables (facial action units - AUs) that are structurally dependent. Their structure arises from statistically induced co-occurrence patterns of AU intensity levels. Modeling this structure is critical for improving the estimation performance; however, this performance is bounded by the quality of the input features extracted from face images. The goal of this paper is to model these structures and estimate complex feature representations simultaneously by combining conditional random field (CRF) encoded AU dependencies with deep learning. To this end, we propose a novel Copula CNN deep learning approach for modeling multivariate ordinal variables. Our model accounts for ordinal structure in output variables and their non-linear dependencies via copula functions modeled as cliques of a CRF. These are jointly optimized with deep CNN feature encoding layers using a newly introduced balanced batch iterative training algorithm. We demonstrate the effectiveness of our approach on the task of AU intensity estimation on two benchmark datasets. We show that joint learning of the deep features and the target output structure results in significant performance gains compared to existing deep structured models for analysis of facial expressions.
引用
收藏
页码:5709 / 5718
页数:10
相关论文
共 50 条
  • [31] Semantic Learning for Facial Action Unit Detection
    Wang, Xuehan
    Chen, C. L. Philip
    Yuan, Haozhang
    Zhang, Tong
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (03): : 1372 - 1380
  • [32] Facial Action Unit Intensity Detection by Extracting Complimentary Information using Distance Metric Learning
    Rathee, Neeru
    Ganotra, Dinesh
    Rathee, Ajay
    [J]. IETE JOURNAL OF RESEARCH, 2020, 66 (02) : 214 - 223
  • [33] Edge Convolutional Network for Facial Action Intensity Estimation
    Li, Liandong
    Baltrusaitis, Tadas
    Sun, Bo
    Morency, Louis-Philippe
    [J]. PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 171 - 178
  • [34] JOINT FACIAL ACTION UNIT INTENSITY PREDICTION AND REGION LOCALISATION
    Fan, Yachun
    Shen, Jie
    Cheng, Housen
    Tian, Feng
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [35] FATAUVA-Net : An Integrated Deep Learning Framework for Facial Attribute Recognition, Action Unit Detection, and Valence-Arousal Estimation
    Chang, Wei-Yi
    Hsu, Shih-Huan
    Chien, Jen-Hsien
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1963 - 1971
  • [36] Semi-Supervised Deep Neural Network for Joint Intensity Estimation of Multiple Facial Action Units
    Zhang, Yong
    Fan, Yanbo
    Dong, Weiming
    Hu, Bao-Gang
    Ji, Qiang
    [J]. IEEE ACCESS, 2019, 7 : 150743 - 150756
  • [37] Ordinal Deep Learning for Facial Age Estimation
    Liu, Hao
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (02) : 486 - 501
  • [38] Meta Auxiliary Learning for Facial Action Unit Detection
    Li, Yong
    Shan, Shiguang
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2526 - 2538
  • [39] LEARNING EXPRESSION KERNELS FOR FACIAL EXPRESSION INTENSITY ESTIMATION
    Liao, Chia-Te
    Chuang, Hui-Ju
    Lai, Shang-Hong
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2217 - 2220
  • [40] Regression-based intensity estimation of facial action units
    Savran, Arman
    Sankur, Bulent
    Bilge, M. Taha
    [J]. IMAGE AND VISION COMPUTING, 2012, 30 (10) : 774 - 784