SAT-Net: Self-Attention and Temporal Fusion for Facial Action Unit Detection

被引:2
|
作者
Li, Zhihua [1 ]
Zhang, Zheng [1 ]
Yin, Lijun [1 ]
机构
[1] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICPR48806.2021.9413260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research on facial action unit detection has shown remarkable performances by using deep spatial learning models in recent years, however, it is far from reaching its full capacity in learning due to the lack of use of temporal information of AUs across time. Since the AU occurrence in one frame is highly likely related to previous frames in a temporal sequence, exploring temporal correlation of AUs across frames becomes a key motivation of this work. In this paper, we propose a novel temporal fusion and AU-supervised self-attention network (a so-called SAT-Net) to address the AU detection problem. First of all, we input the deep features of a sequence into a convolutional LSTM network and fuse the previous temporal information into the feature map of the last frame, and continue to learn the AU occurrence. Second, considering the AU detection problem is a multi-label classification problem that individual label depends only on certain facial areas, we propose a new self-learned attention mask by focusing the detection of each AU on parts of facial areas through the learning of individual attention mask for each AU, thus increasing the AU independence without the loss of any spatial relations. Our extensive experiments show that the proposed framework achieves better results of AU detection over the state-of-the-arts on two benchmark databases (BP4D and DISFA).
引用
收藏
页码:5036 / 5043
页数:8
相关论文
共 50 条
  • [21] Multimodal Depression Detection Based on Self-Attention Network With Facial Expression and Pupil
    Liu, Xiang
    Shen, Hao
    Li, Huiru
    Tao, Yongfeng
    Yang, Minqiang
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [22] Dual-attention guided network for facial action unit detection
    Song, Wenyu
    Shi, Shuze
    Wu, Yuxuan
    An, Gaoyun
    [J]. IET IMAGE PROCESSING, 2022, 16 (08) : 2157 - 2170
  • [23] Wireless Link Quality Prediction Based on Temporal Convolutional Networks and Self-Attention Fusion
    Wang, Yao
    Liu, Linlan
    [J]. 2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 448 - 453
  • [24] Multi-level feature fusion capsule network with self-attention for facial expression recognition
    Huang, Zhiji
    Yu, Songsen
    Liang, Jun
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
  • [25] J(A)over-capA-Net: Joint Facial Action Unit Detection and Face Alignment Via Adaptive Attention
    Shao, Zhiwen
    Liu, Zhilei
    Cai, Jianfei
    Ma, Lizhuang
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 321 - 340
  • [26] A visual self-attention network for facial expression recognition
    Yu, Naigong
    Bai, Deguo
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [27] A self-attention network for smoke detection
    Jiang, Minghua
    Zhao, Yaxin
    Yu, Feng
    Zhou, Changlong
    Peng, Tao
    [J]. FIRE SAFETY JOURNAL, 2022, 129
  • [28] EFFECT OF SELF-ATTENTION ON HEARTBEAT DETECTION
    HODAPP, V
    [J]. JOURNAL OF PSYCHOPHYSIOLOGY, 1995, 9 (03) : 280 - 280
  • [29] A Coarse-to-Fine Facial Landmark Detection Method Based on Self-attention Mechanism
    Gao, Pengcheng
    Lu, Ke
    Xue, Jian
    Shao, Ling
    Lyu, Jiayi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 926 - 938
  • [30] Spatio-Temporal Self-Attention Weighted VLAD Neural Network for Action Recognition
    Cheng, Shilei
    Xie, Mei
    Ma, Zheng
    Li, Siqi
    Gu, Song
    Yang, Feng
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (01) : 220 - 224