Semi-Supervised Deep Neural Network for Joint Intensity Estimation of Multiple Facial Action Units

被引:1
|
作者
Zhang, Yong [1 ]
Fan, Yanbo [1 ]
Dong, Weiming [2 ]
Hu, Bao-Gang [2 ]
Ji, Qiang [3 ]
机构
[1] Tencent AI Lab, Shenzhen 518057, Guangdong, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[3] Rensselaer Polytech Inst, Dept Elect Comp & Syst Engn, Troy, NY 12180 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Gold; Estimation; Hidden Markov models; Training; Face; Task analysis; Neural networks; Facial action units; intensity estimation; deep learning; weakly supervised learning; TRACKING; MODEL;
D O I
10.1109/ACCESS.2019.2947201
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Facial action units (AUs) are defined to depict movements of facial muscles, which are basic elements to encode facial expressions. Automatic AU intensity estimation is an important task in affective computing. Previous works leverage the representation power of deep neural networks (DNNs) to improve the performance of intensity estimation. However, a large number of intensity annotations are required to train DNNs that contain millions of parameters. But it is expensive and difficult to build a large-scale database with AU intensity annotation since AU annotation requires annotators have strong domain expertise. We propose a novel semi-supervised deep convolutional network that leverages extremely limited AU annotations for AU intensity estimation. It requires only intensity annotations of keyframes of training sequences. Domain knowledge on AUs is leveraged to provide weak supervisory information, including relative appearance similarity, temporal intensity ordering, facial symmetry, and contrastive appearance difference. We also propose a strategy to train a model for joint intensity estimation of multiple AUs under the setting of semi-supervised learning, which greatly improves the efficiency during inference. We perform empirical experiments on two public benchmark expression databases and make comparisons with state-of-the-art methods to demonstrate the effectiveness of the proposed method.
引用
收藏
页码:150743 / 150756
页数:14
相关论文
共 50 条
  • [1] Weakly-supervised Deep Convolutional Neural Network Learning for Facial Action Unit Intensity Estimation
    Zhang, Yong
    Dong, Weiming
    Hu, Bao-Gang
    Ji, Qiang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2314 - 2323
  • [2] Trend-Aware Supervision: On Learning Invariance for Semi-Supervised Facial Action Unit Intensity Estimation
    Chen, Yingjie
    Zhang, Jiarui
    Wang, Tao
    Liang, Yun
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 483 - 491
  • [3] Decoupled Deep Neural Network for Semi-supervised Semantic Segmentation
    Hong, Seunghoon
    Noh, Hyeonwoo
    Han, Bohyung
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [4] On the Importance of Stereo for Accurate Depth Estimation: An Efficient Semi-Supervised Deep Neural Network Approach
    Smolyanskiy, Nikolai
    Kamenev, Alexey
    Birchfield, Stan
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1120 - 1128
  • [5] Estimation of Interaction Forces in Robotic Surgery using a Semi-Supervised Deep Neural Network Model
    Marban, Arturo
    Srinivasan, Vignesh
    Samek, Wojciech
    Fernandez, Josep
    Casals, Alicia
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 761 - 768
  • [6] Multi-softmax Deep Neural Network for Semi-supervised Training
    Su, Hang
    Xu, Haihua
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3239 - 3243
  • [7] Facial landmark detection by semi-supervised deep learning
    Tang, Xin
    Guo, Fang
    Shen, Jianbing
    Du, Tianyuan
    [J]. NEUROCOMPUTING, 2018, 297 : 22 - 32
  • [8] Ordinal information based facial expression intensity estimation for emotional interaction: a novel semi-supervised deep learning approach
    Xu, Ruyi
    Han, Jiaxu
    Chen, Jingying
    [J]. COMPUTING, 2024, 106 (04) : 1121 - 1138
  • [9] Ordinal information based facial expression intensity estimation for emotional interaction: a novel semi-supervised deep learning approach
    Ruyi Xu
    Jiaxu Han
    Jingying Chen
    [J]. Computing, 2024, 106 : 1121 - 1138
  • [10] SEMI-SUPERVISED TRAINING OF DEEP NEURAL NETWORKS
    Vesely, Karel
    Hannemann, Mirko
    Burget, Lukas
    [J]. 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 267 - 272