Sparse landmarks for facial action unit detection using vision transformer and perceiver

被引:0
|
作者
Cakir, Duygu [1 ]
Yilmaz, Gorkem [2 ]
Arica, Nafiz [3 ]
机构
[1] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Software Engn, Istanbul, Turkiye
[2] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Comp Engn, Istanbul, Turkiye
[3] Piri Reis Univ, Fac Engn, Dept Informat Syst Engn, Istanbul, Turkiye
关键词
action unit detection; sparse learning; vision transformer; perceiver; RECOGNITION; PATCHES;
D O I
10.1504/IJCSE.2023.10060451
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ability to accurately detect facial expressions, represented by facial action units (AUs), holds significant implications across diverse fields such as mental health diagnosis, security, and human-computer interaction. Although earlier approaches have made progress, the burgeoning complexity of facial actions demands more nuanced, computationally efficient techniques. This study pioneers the integration of sparse learning with vision transformer (ViT) and perceiver networks, focusing on the most active and descriptive landmarks for AU detection across both controlled (DISFA, BP4D) and in-the-wild (EmotioNet) datasets. Our novel approach, employing active landmark patches instead of the whole face, not only attains state-of-the-art performance but also uncovers insights into the differing attention mechanisms of ViT and perceiver. This fusion of techniques marks a significant advancement in facial analysis, potentially reshaping strategies in noise reduction and patch optimisation, setting a robust foundation for future research in the domain.
引用
收藏
页码:607 / 620
页数:15
相关论文
共 50 条
  • [31] Fall Event Detection using Vision Transformer
    Dey, Ankita
    Rajan, Sreeraman
    Xiao, George
    Lu, Jianping
    2022 IEEE SENSORS, 2022,
  • [32] Pupil Detection Using Hybrid Vision Transformer
    Wang, Li
    Wang, Changyuan
    Zhang, Yu
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (12)
  • [33] A Novel Machine Vision-Based 3D Facial Action Unit Identification for Fatigue Detection
    Sikander, Gulbadan
    Anwar, Shahzad
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (05) : 2730 - 2740
  • [34] Lightweight network architecture using difference saliency maps for facial action unit detection
    Jing Chen
    Chenhui Wang
    Kejun Wang
    Meichen Liu
    Applied Intelligence, 2022, 52 : 6354 - 6375
  • [35] Lightweight network architecture using difference saliency maps for facial action unit detection
    Chen, Jing
    Wang, Chenhui
    Wang, Kejun
    Liu, Meichen
    APPLIED INTELLIGENCE, 2022, 52 (06) : 6354 - 6375
  • [36] Automated pain detection using STA-LSTM on facial landmarks
    Du, Tiehua
    Choo, Keng Wah
    Kong, Wai Ming
    Tan, Chin Wen
    Teo, Jing Chun
    Chan, Diana
    Sng, Ban Leong
    2024 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, METAHEURISTICS & SWARM INTELLIGENCE, ISMSI 2024, 2024, : 36 - 40
  • [37] Drivers' Visual Distraction Detection Using Facial Landmarks and Head Pose
    Zhang, Shile
    Abdel-Aty, Mohamed
    TRANSPORTATION RESEARCH RECORD, 2022, 2676 (09) : 491 - 501
  • [38] Expression Empowered ResiDen Network for Facial Action Unit Detection
    Jyoti, Shreyank
    Sharma, Garima
    Dhall, Abhinav
    2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 262 - 269
  • [39] AUFormer: Vision Transformers Are Parameter-Efficient Facial Action Unit Detectors
    Yuan, Kaishen
    Yu, Zitong
    Liu, Xin
    Xie, Weicheng
    Yue, Huanjing
    Yang, Jingyu
    COMPUTER VISION - ECCV 2024, PT L, 2025, 15108 : 427 - 445
  • [40] Facial Action Unit Detection via Adaptive Attention and Relation
    Shao, Zhiwen
    Zhou, Yong
    Cai, Jianfei
    Zhu, Hancheng
    Yao, Rui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3354 - 3366