Facial Action Unit Detection via Adaptive Attention and Relation

被引:1
|
作者
Shao, Zhiwen [1 ,2 ,3 ]
Zhou, Yong [1 ,2 ]
Cai, Jianfei [4 ]
Zhu, Hancheng [1 ,2 ]
Yao, Rui [1 ,2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Peoples R China
[2] Minist Educ Peoples Republ China, Engn Res Ctr Mine Digitizat, Xuzhou 221116, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[4] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
基金
中国国家自然科学基金;
关键词
Facial AU detection; adaptive attention regression network; adaptive spatio-temporal graph convolutional network;
D O I
10.1109/TIP.2023.3277794
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial action unit (AU) detection is challenging due to the difficulty in capturing correlated information from subtle and dynamic AUs. Existing methods often resort to the localization of correlated regions of AUs, in which predefining local AU attentions by correlated facial landmarks often discards essential parts, or learning global attention maps often contains irrelevant areas. Furthermore, existing relational reasoning methods often employ common patterns for all AUs while ignoring the specific way of each AU. To tackle these limitations, we propose a novel adaptive attention and relation (AAR) framework for facial AU detection. Specifically, we propose an adaptive attention regression network to regress the global attention map of each AU under the constraint of attention predefinition and the guidance of AU detection, which is beneficial for capturing both specified dependencies by landmarks in strongly correlated regions and facial globally distributed dependencies in weakly correlated regions. Moreover, considering the diversity and dynamics of AUs, we propose an adaptive spatio-temporal graph convolutional network to simultaneously reason the independent pattern of each AU, the inter-dependencies among AUs, as well as the temporal dependencies. Extensive experiments show that our approach (i) achieves competitive performance on challenging benchmarks including BP4D, DISFA, and GFT in constrained scenarios and Aff-Wild2 in unconstrained scenarios, and (ii) can precisely learn the regional correlation distribution of each AU.
引用
收藏
页码:3354 / 3366
页数:13
相关论文
共 50 条
  • [1] Facial Action Unit Detection Using Attention and Relation Learning
    Shao, Zhiwen
    Liu, Zhilei
    Cai, Jianfei
    Wu, Yunsheng
    Ma, Lizhuang
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) : 1274 - 1289
  • [2] JÂA-Net: Joint Facial Action Unit Detection and Face Alignment Via Adaptive Attention
    Zhiwen Shao
    Zhilei Liu
    Jianfei Cai
    Lizhuang Ma
    [J]. International Journal of Computer Vision, 2021, 129 : 321 - 340
  • [3] Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment
    Shao, Zhiwen
    Liu, Zhilei
    Cai, Jianfei
    Ma, Lizhuang
    [J]. COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 725 - 740
  • [4] Facial Action Unit Recognition by Prior and Adaptive Attention
    Shao, Zhiwen
    Zhou, Yong
    Zhu, Hancheng
    Du, Wen-Liang
    Yao, Rui
    Chen, Hao
    [J]. ELECTRONICS, 2022, 11 (19)
  • [5] J(A)over-capA-Net: Joint Facial Action Unit Detection and Face Alignment Via Adaptive Attention
    Shao, Zhiwen
    Liu, Zhilei
    Cai, Jianfei
    Ma, Lizhuang
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 321 - 340
  • [6] Dual-attention guided network for facial action unit detection
    Song, Wenyu
    Shi, Shuze
    Wu, Yuxuan
    An, Gaoyun
    [J]. IET IMAGE PROCESSING, 2022, 16 (08) : 2157 - 2170
  • [7] Facial action unit detection via hybrid relational reasoning
    Shao, Zhiwen
    Zhou, Yong
    Liu, Bing
    Zhu, Hancheng
    Du, Wen-Liang
    Zhao, Jiaqi
    [J]. VISUAL COMPUTER, 2022, 38 (9-10): : 3045 - 3057
  • [8] Facial action unit detection via hybrid relational reasoning
    Zhiwen Shao
    Yong Zhou
    Bing Liu
    Hancheng Zhu
    Wen-Liang Du
    Jiaqi Zhao
    [J]. The Visual Computer, 2022, 38 : 3045 - 3057
  • [9] Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection
    Liu, Zhilei
    Dong, Jiahui
    Zhang, Cuicui
    Wang, Longbiao
    Dang, Jianwu
    [J]. MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 489 - 501
  • [10] Facial Landmark Detection via Attention-Adaptive Deep Network
    Sadiq, Muhammad
    Shi, Daming
    Guo, Meiqin
    Cheng, Xiaochun
    [J]. IEEE ACCESS, 2019, 7 : 181041 - 181050