Facial Action Unit Recognition Based on Self-Attention Spatiotemporal Fusion

被引:0
|
作者
Liang, Chaolei [1 ]
Zou, Wei [1 ]
Hu, Danfeng [1 ]
Wang, JiaJun [1 ]
机构
[1] Sch Elect & Informat Engn, Suzhou, Peoples R China
关键词
Facial action unit; Graph Convolutional Neural Network; Attention mechanism; Spatio-temporal relation;
D O I
10.1145/3670105.3670210
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Facial Action Units (AUs) serve as a precise descriptor of facial expressions, revealing an individual's psychological and mental state. Therefore, AU detection plays important roles in facial expression recognition. Existing methods often focus on extracting intra-frame information while pay less attention to inter-frame feature changes. To address this issue, this paper proposes a self-attention spatiotemporal fusion method (SAtt-STPN). In this method, a feature extractor (AFE) is specifically designed to extract uniform feature information from both strongly and weakly correlated regions. A spatiotemporal perception (STP) module is specifically designed to capture temporal information for each AU through mutually-driven independent branches in both spatial and temporal dimensions while a graph convolutional network is adopted to model intra-frame AU relationships (ARM). Ultimately, intra-frame and inter-frame information are weighted and fused for classification. Experimental results on two public datasets (BP4D and DISFA) show that the our proposed SAtt-STPN outperforms state-of-the-art methods in facial AU detection.
引用
收藏
页码:600 / 605
页数:6
相关论文
共 50 条
  • [31] Multimodal Fusion Method Based on Self-Attention Mechanism
    Zhu, Hu
    Wang, Ze
    Shi, Yu
    Hua, Yingying
    Xu, Guoxia
    Deng, Lizhen
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [32] Feature Level Fusion for Bimodal Facial Action Unit Recognition
    Meng, Zibo
    Han, Shizhong
    Chen, Min
    Tong, Yan
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 471 - 476
  • [33] Texture and shape information fusion for facial expression and facial action unit recognition
    Kotsia, Irene
    Zafeiriou, Stefanos
    Pitas, Loannis
    PATTERN RECOGNITION, 2008, 41 (03) : 833 - 851
  • [34] Polarimetric HRRP Recognition Based on ConvLSTM With Self-Attention
    Zhang, Liang
    Li, Yang
    Wang, Yanhua
    Wang, Junfu
    Long, Teng
    IEEE SENSORS JOURNAL, 2021, 21 (06) : 7884 - 7898
  • [35] Micro-expression Recognition Based on Facial Graph Representation Learning and Facial Action Unit Fusion
    Lei, Ling
    Chen, Tong
    Li, Shigang
    Li, Jianfeng
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1571 - 1580
  • [36] Finger Vein Recognition Based on ResNet With Self-Attention
    Zhang, Zhibo
    Chen, Guanghua
    Zhang, Weifeng
    Wang, Huiyang
    IEEE ACCESS, 2024, 12 : 1943 - 1951
  • [37] Local Multi-Head Channel Self-Attention for Facial Expression Recognition
    Pecoraro, Roberto
    Basile, Valerio
    Bono, Viviana
    INFORMATION, 2022, 13 (09)
  • [38] Neural network based on convolution and self-attention fusion mechanism for plant leaves disease recognition
    Zhao, Yun
    Li, Yang
    Wu, Na
    Xu, Xing
    CROP PROTECTION, 2024, 180
  • [39] SPATIO-TEMPORAL SLOWFAST SELF-ATTENTION NETWORK FOR ACTION RECOGNITION
    Kim, Myeongjun
    Kim, Taehun
    Kim, Daijin
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2206 - 2210
  • [40] Self-Attention Pooling-Based Long-Term Temporal Network for Action Recognition
    Li, Huifang
    Huang, Jingwei
    Zhou, Mengchu
    Shi, Qisong
    Fei, Qing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (01) : 65 - 77