AU R-CNN: Encoding expert prior knowledge into R-CNN for action unit detection

被引：58

作者：

Ma, Chen ^{[1
,2
]}

Chen, Li ^{[1
,2
]}

Yong, Junhai ^{[1
,2
]}

机构：

[1] Tsinghua Univ, Sch Software, Beijing 100084, Peoples R China

[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing 100084, Peoples R China

来源：

NEUROCOMPUTING | 2019年 / 355卷

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Action unit detection; Expert prior knowledge; R-CNN; Facial Action Coding System; MACHINE;

D O I：

10.1016/j.neucom.2019.03.082

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detecting action units (AUs) on human faces is challenging because various AUs make subtle facial appearance change over various regions at different scales. Current works have attempted to recognize AUs by emphasizing important regions. However, the incorporation of expert prior knowledge into region definition remains under-exploited, and current AU detection approaches do not use regional convolutional neural networks (R-CNN) with expert prior knowledge to directly focus on AU-related regions adaptively. By incorporating expert prior knowledge, we propose a novel R-CNN based model named AU R-CNN. The proposed solution offers two main contributions: (1) AU R-CNN directly observes different facial regions, where various AUs are located. Expert prior knowledge is encoded in the region and the Rol-level label definition. This design produces considerably better detection performance than existing approaches. (2) We integrate various dynamic models (including convolutional long short-term memory, two stream network, conditional random field, and temporal action localization network) into AU R-CNN and then investigate and analyze the reason behind the performance of dynamic models. Experiment results demonstrate that only static RGB image information and no optical flow-based AU R-CNN surpasses the one fused with dynamic models. AU R-CNN is also superior to traditional CNNs that use the same backbone on varying image resolutions. State-of-the-art recognition performance of AU detection is achieved. The complete network is end-to-end trainable. Experiments on BP4D and DISFA datasets show the effectiveness of our approach. Code will be made available. (C) 2019 Elsevier B.V. All rights reserved.

引用

页码：35 / 47

页数：13

共 50 条

[1] ME R-CNN: Multi-Expert R-CNN for Object Detection
Lee, Hyungtae
Eum, Sungmin
Kwon, Heesung
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1030 - 1044
[2] Crack Detection and Comparison Study Based on Faster R-CNN and Mask R-CNN
Xu, Xiangyang
Zhao, Mian
Shi, Peixin
Ren, Ruiqi
He, Xuhui
Wei, Xiaojun
Yang, Hao
[J]. SENSORS, 2022, 22 (03)
[3] IEMask R-CNN: Information-Enhanced Mask R-CNN
Bi, Xiuli
Hu, Jinwu
Xiao, Bin
Li, Weisheng
Gao, Xinbo
[J]. IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (02) : 688 - 700
[4] Nuclei R-CNN: Improve Mask R-CNN for Nuclei Segmentation
Lv, Guofeng
Wen, Ke
Wu, Zheng
Jin, Xu
An, Hong
He, Jie
[J]. 2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 357 - 362
[5] SE-Mask R-CNN: An improved Mask R-CNN for apple detection and segmentation
Liu, Yikun
Yang, Gongping
Huang, Yuwen
Yin, Yilong
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 6715 - 6725
[6] Mask R-CNN
He, Kaiming
Gkioxari, Georgia
Dollar, Piotr
Girshick, Ross
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2980 - 2988
[7] Mesh R-CNN
Gkioxari, Georgia
Malik, Jitendra
Johnson, Justin
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9784 - 9794
[8] Grid R-CNN
Lu, Xin
Li, Buyu
Yue, Yuxin
Li, Quanquan
Yan, Junjie
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7355 - 7364
[9] Oriented R-CNN for Object Detection
Xie, Xingxing
Cheng, Gong
Wang, Jiabao
Yao, Xiwen
Han, Junwei
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3500 - 3509
[10] Face Detection with the Faster R-CNN
Jiang, Huaizu
Learned-Miller, Erik
[J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 650 - 657

← 1 2 3 4 5 →