AU R-CNN: Encoding expert prior knowledge into R-CNN for action unit detection

被引:58
|
作者
Ma, Chen [1 ,2 ]
Chen, Li [1 ,2 ]
Yong, Junhai [1 ,2 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing 100084, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Action unit detection; Expert prior knowledge; R-CNN; Facial Action Coding System; MACHINE;
D O I
10.1016/j.neucom.2019.03.082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting action units (AUs) on human faces is challenging because various AUs make subtle facial appearance change over various regions at different scales. Current works have attempted to recognize AUs by emphasizing important regions. However, the incorporation of expert prior knowledge into region definition remains under-exploited, and current AU detection approaches do not use regional convolutional neural networks (R-CNN) with expert prior knowledge to directly focus on AU-related regions adaptively. By incorporating expert prior knowledge, we propose a novel R-CNN based model named AU R-CNN. The proposed solution offers two main contributions: (1) AU R-CNN directly observes different facial regions, where various AUs are located. Expert prior knowledge is encoded in the region and the Rol-level label definition. This design produces considerably better detection performance than existing approaches. (2) We integrate various dynamic models (including convolutional long short-term memory, two stream network, conditional random field, and temporal action localization network) into AU R-CNN and then investigate and analyze the reason behind the performance of dynamic models. Experiment results demonstrate that only static RGB image information and no optical flow-based AU R-CNN surpasses the one fused with dynamic models. AU R-CNN is also superior to traditional CNNs that use the same backbone on varying image resolutions. State-of-the-art recognition performance of AU detection is achieved. The complete network is end-to-end trainable. Experiments on BP4D and DISFA datasets show the effectiveness of our approach. Code will be made available. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:35 / 47
页数:13
相关论文
共 50 条
  • [1] ME R-CNN: Multi-Expert R-CNN for Object Detection
    Lee, Hyungtae
    Eum, Sungmin
    Kwon, Heesung
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1030 - 1044
  • [2] Crack Detection and Comparison Study Based on Faster R-CNN and Mask R-CNN
    Xu, Xiangyang
    Zhao, Mian
    Shi, Peixin
    Ren, Ruiqi
    He, Xuhui
    Wei, Xiaojun
    Yang, Hao
    [J]. SENSORS, 2022, 22 (03)
  • [3] IEMask R-CNN: Information-Enhanced Mask R-CNN
    Bi, Xiuli
    Hu, Jinwu
    Xiao, Bin
    Li, Weisheng
    Gao, Xinbo
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (02) : 688 - 700
  • [4] Nuclei R-CNN: Improve Mask R-CNN for Nuclei Segmentation
    Lv, Guofeng
    Wen, Ke
    Wu, Zheng
    Jin, Xu
    An, Hong
    He, Jie
    [J]. 2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 357 - 362
  • [5] SE-Mask R-CNN: An improved Mask R-CNN for apple detection and segmentation
    Liu, Yikun
    Yang, Gongping
    Huang, Yuwen
    Yin, Yilong
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 6715 - 6725
  • [6] Mask R-CNN
    He, Kaiming
    Gkioxari, Georgia
    Dollar, Piotr
    Girshick, Ross
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2980 - 2988
  • [7] Mesh R-CNN
    Gkioxari, Georgia
    Malik, Jitendra
    Johnson, Justin
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9784 - 9794
  • [8] Grid R-CNN
    Lu, Xin
    Li, Buyu
    Yue, Yuxin
    Li, Quanquan
    Yan, Junjie
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7355 - 7364
  • [9] Oriented R-CNN for Object Detection
    Xie, Xingxing
    Cheng, Gong
    Wang, Jiabao
    Yao, Xiwen
    Han, Junwei
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3500 - 3509
  • [10] Face Detection with the Faster R-CNN
    Jiang, Huaizu
    Learned-Miller, Erik
    [J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 650 - 657