Sparse landmarks for facial action unit detection using vision transformer and perceiver

被引:0
|
作者
Cakir, Duygu [1 ]
Yilmaz, Gorkem [2 ]
Arica, Nafiz [3 ]
机构
[1] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Software Engn, Istanbul, Turkiye
[2] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Comp Engn, Istanbul, Turkiye
[3] Piri Reis Univ, Fac Engn, Dept Informat Syst Engn, Istanbul, Turkiye
关键词
action unit detection; sparse learning; vision transformer; perceiver; RECOGNITION; PATCHES;
D O I
10.1504/IJCSE.2023.10060451
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ability to accurately detect facial expressions, represented by facial action units (AUs), holds significant implications across diverse fields such as mental health diagnosis, security, and human-computer interaction. Although earlier approaches have made progress, the burgeoning complexity of facial actions demands more nuanced, computationally efficient techniques. This study pioneers the integration of sparse learning with vision transformer (ViT) and perceiver networks, focusing on the most active and descriptive landmarks for AU detection across both controlled (DISFA, BP4D) and in-the-wild (EmotioNet) datasets. Our novel approach, employing active landmark patches instead of the whole face, not only attains state-of-the-art performance but also uncovers insights into the differing attention mechanisms of ViT and perceiver. This fusion of techniques marks a significant advancement in facial analysis, potentially reshaping strategies in noise reduction and patch optimisation, setting a robust foundation for future research in the domain.
引用
收藏
页码:607 / 620
页数:15
相关论文
共 50 条
  • [1] Progressive Multi-Scale Vision Transformer for Facial Action Unit Detection
    Wang, Chongwen
    Wang, Zicheng
    FRONTIERS IN NEUROROBOTICS, 2022, 15 (15):
  • [2] Facial Action Unit Detection using 3D Face Landmarks for Pain Detection
    Feghoul, Kevin
    Bouazizi, Mondher
    Maia, Deise Santana
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [3] Enhanced facial action unit detection with adaptable patch sizes on representative landmarks
    Duygu Cakir
    Gorkem Yilmaz
    Nafiz Arica
    Neural Computing and Applications, 2025, 37 (5) : 3777 - 3791
  • [4] Novel Facial Expression Recognition by Combining Action Unit Detection with Sparse Representation Classification
    Su, Te-Feng
    Weng, Ching-Hua
    Lai, Shang-Hong
    39TH ANNUAL IEEE COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2015), VOL 2, 2015, : 719 - 725
  • [5] Facial Action Unit Detection using Variable Decision Thresholds
    Aksoy, Nukhet
    Sert, Mustafa
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 2185 - 2188
  • [6] Facial Action Unit Detection Using Attention and Relation Learning
    Shao, Zhiwen
    Liu, Zhilei
    Cai, Jianfei
    Wu, Yunsheng
    Ma, Lizhuang
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) : 1274 - 1289
  • [7] Facial Action Unit Detection With Transformers
    Jacob, Geethu Miriam
    Stenger, Bjorn
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7676 - 7685
  • [8] Relative Facial Action Unit Detection
    Khademi, Mahmoud
    Morency, Louis-Philippe
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 1090 - 1095
  • [9] TransMarker: A Pure Vision Transformer for Facial Landmark Detection
    Wu, Wenyan
    Cai, Yici
    Zhou, Qiang
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3580 - 3587
  • [10] Facial Action Unit Detection Using Kernel Partial Least Squares
    Gehrig, Tobias
    Ekenel, Hazim Kemal
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,