Multi-view facial action unit detection via DenseNets and CapsNets

被引:0
|
作者
Ren, Dakai [1 ]
Wen, Xiangmin [1 ]
Chen, Jiazhong [2 ]
Han, Yu [2 ]
Zhang, Shiqi [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
关键词
Facial action unit; Facial expression recognition; Emotion recognition; CapsNets; DenseNets; Deep learning; Convolutional neural networks; EXPRESSION RECOGNITION;
D O I
10.1007/s11042-021-11147-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Though the standard convolutional neural networks (CNNs) have been proposed to increase the robustness of facial action unit (AU) detection regarding pose variations, it is hard to enhance detection performance because the standard CNNs are not robust enough to affine transformation. To address this issue, two novel architectures termed as AUCaps and AUCaps++ are proposed for multi-view and multi-label facial AU detection in this work. In these two architectures, one or more dense blocks and one capsule networks (CapsNets) are stacked. Specifically, The dense blocks prefixed before CapsNets are used to learn more discriminative high-level AU features, and the CapsNets is exploited to learn more view-invariant AU features. Moreover, the capsule types and digit capsule dimension are optimized to avoid the computation and storage burden caused by the dynamic routing in standard CapsNets. Because the AUCaps and AUCaps++ are trained by jointly optimizing multi-label loss of AU and reconstruction loss of viewpoint image, the proposed method could achieve high F1 score and learn human face roughly in the reconstruction images over different AUs. Numerical results of within-dataset and cross-dataset show that the average F1 scores of the proposed method outperform the competitors using hand-crafted features or deep learning features by a big margin on two public datasets.
引用
收藏
页码:19377 / 19394
页数:18
相关论文
共 50 条
  • [1] Multi-view facial action unit detection via DenseNets and CapsNets
    Dakai Ren
    Xiangmin Wen
    Jiazhong Chen
    Yu Han
    Shiqi Zhang
    [J]. Multimedia Tools and Applications, 2022, 81 : 19377 - 19394
  • [2] Multi-view dynamic facial action unit detection
    Romero, Andres
    Leon, Juan
    Arbelaez, Pablo
    [J]. IMAGE AND VISION COMPUTING, 2022, 122
  • [3] Multi-view facial action unit detection via deep feature enhancement
    Tang, Chuangao
    Lu, Cheng
    Zheng, Wenming
    Zong, Yuan
    Li, Sunan
    [J]. ELECTRONICS LETTERS, 2021, 57 (25) : 970 - 972
  • [4] MMA-Net: Multi-view mixed attention mechanism for facial action unit detection
    Shang, Ziqiao
    Du, Congju
    Li, Bingyin
    Yan, Zengqiang
    Yu, Li
    [J]. PATTERN RECOGNITION LETTERS, 2023, 172 : 165 - 171
  • [5] Graph-Based Multi-Modal Multi-View Fusion for Facial Action Unit Recognition
    Chen, Jianrong
    Dey, Sujit
    [J]. IEEE ACCESS, 2024, 12 : 69310 - 69324
  • [6] Multi-view Surgical Video Action Detection via Mixed Global View Attention
    Schmidt, Adam
    Sharghi, Aidean
    Haugerud, Helene
    Oh, Daniel
    Mohareri, Omid
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT IV, 2021, 12904 : 626 - 635
  • [7] View-Independent Facial Action Unit Detection
    Tang, Chuangao
    Zheng, Wenming
    Yan, Jingwei
    Li, Qiang
    Li, Yang
    Zhang, Tong
    Cui, Zhen
    [J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 878 - 882
  • [8] Multi View Facial Action Unit Detection based on CNN and BLSTM-RNN
    He, Jun
    Li, Dongliang
    Yang, Bin
    Cao, Siming
    Sun, Bo
    Yu, Lejun
    [J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 848 - 853
  • [9] Multi-View Facial Expression Recognition with Multi-View Facial Expression Light Weight Network
    Shao Jie
    Qian Yongsheng
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, 2020, 30 (04) : 805 - 814
  • [10] Multi-View Facial Expression Recognition with Multi-View Facial Expression Light Weight Network
    [J]. Pattern Recognition and Image Analysis, 2020, 30 : 805 - 814