SAT-Net: Self-Attention and Temporal Fusion for Facial Action Unit Detection

被引：2

作者：

Li, Zhihua ^{[1
]}

Zhang, Zheng ^{[1
]}

Yin, Lijun ^{[1
]}

机构：

[1] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/ICPR48806.2021.9413260

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Research on facial action unit detection has shown remarkable performances by using deep spatial learning models in recent years, however, it is far from reaching its full capacity in learning due to the lack of use of temporal information of AUs across time. Since the AU occurrence in one frame is highly likely related to previous frames in a temporal sequence, exploring temporal correlation of AUs across frames becomes a key motivation of this work. In this paper, we propose a novel temporal fusion and AU-supervised self-attention network (a so-called SAT-Net) to address the AU detection problem. First of all, we input the deep features of a sequence into a convolutional LSTM network and fuse the previous temporal information into the feature map of the last frame, and continue to learn the AU occurrence. Second, considering the AU detection problem is a multi-label classification problem that individual label depends only on certain facial areas, we propose a new self-learned attention mask by focusing the detection of each AU on parts of facial areas through the learning of individual attention mask for each AU, thus increasing the AU independence without the loss of any spatial relations. Our extensive experiments show that the proposed framework achieves better results of AU detection over the state-of-the-arts on two benchmark databases (BP4D and DISFA).

引用

页码：5036 / 5043

页数：8

共 50 条

[21] Multimodal Depression Detection Based on Self-Attention Network With Facial Expression and Pupil
Liu, Xiang
Shen, Hao
Li, Huiru
Tao, Yongfeng
Yang, Minqiang
[J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
[22] Dual-attention guided network for facial action unit detection
Song, Wenyu
Shi, Shuze
Wu, Yuxuan
An, Gaoyun
[J]. IET IMAGE PROCESSING, 2022, 16 (08) : 2157 - 2170
[23] Wireless Link Quality Prediction Based on Temporal Convolutional Networks and Self-Attention Fusion
Wang, Yao
Liu, Linlan
[J]. 2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 448 - 453
[24] Multi-level feature fusion capsule network with self-attention for facial expression recognition
Huang, Zhiji
Yu, Songsen
Liang, Jun
[J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
[25] J(A)over-capA-Net: Joint Facial Action Unit Detection and Face Alignment Via Adaptive Attention
Shao, Zhiwen
Liu, Zhilei
Cai, Jianfei
Ma, Lizhuang
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 321 - 340
[26] A visual self-attention network for facial expression recognition
Yu, Naigong
Bai, Deguo
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[27] A self-attention network for smoke detection
Jiang, Minghua
Zhao, Yaxin
Yu, Feng
Zhou, Changlong
Peng, Tao
[J]. FIRE SAFETY JOURNAL, 2022, 129
[28] EFFECT OF SELF-ATTENTION ON HEARTBEAT DETECTION
HODAPP, V
[J]. JOURNAL OF PSYCHOPHYSIOLOGY, 1995, 9 (03) : 280 - 280
[29] A Coarse-to-Fine Facial Landmark Detection Method Based on Self-attention Mechanism
Gao, Pengcheng
Lu, Ke
Xue, Jian
Shao, Ling
Lyu, Jiayi
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 926 - 938
[30] Spatio-Temporal Self-Attention Weighted VLAD Neural Network for Action Recognition
Cheng, Shilei
Xie, Mei
Ma, Zheng
Li, Siqi
Gu, Song
Yang, Feng
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (01) : 220 - 224

← 1 2 3 4 5 →