Sparse landmarks for facial action unit detection using vision transformer and perceiver

被引:0
|
作者
Cakir, Duygu [1 ]
Yilmaz, Gorkem [2 ]
Arica, Nafiz [3 ]
机构
[1] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Software Engn, Istanbul, Turkiye
[2] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Comp Engn, Istanbul, Turkiye
[3] Piri Reis Univ, Fac Engn, Dept Informat Syst Engn, Istanbul, Turkiye
关键词
action unit detection; sparse learning; vision transformer; perceiver; RECOGNITION; PATCHES;
D O I
10.1504/IJCSE.2023.10060451
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ability to accurately detect facial expressions, represented by facial action units (AUs), holds significant implications across diverse fields such as mental health diagnosis, security, and human-computer interaction. Although earlier approaches have made progress, the burgeoning complexity of facial actions demands more nuanced, computationally efficient techniques. This study pioneers the integration of sparse learning with vision transformer (ViT) and perceiver networks, focusing on the most active and descriptive landmarks for AU detection across both controlled (DISFA, BP4D) and in-the-wild (EmotioNet) datasets. Our novel approach, employing active landmark patches instead of the whole face, not only attains state-of-the-art performance but also uncovers insights into the differing attention mechanisms of ViT and perceiver. This fusion of techniques marks a significant advancement in facial analysis, potentially reshaping strategies in noise reduction and patch optimisation, setting a robust foundation for future research in the domain.
引用
收藏
页码:607 / 620
页数:15
相关论文
共 50 条
  • [21] Meta Auxiliary Learning for Facial Action Unit Detection
    Li, Yong
    Shan, Shiguang
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2526 - 2538
  • [22] Stacking multiple cues for facial action unit detection
    Simge Akay
    Nafiz Arica
    The Visual Computer, 2022, 38 : 4235 - 4250
  • [23] Confidence Preserving Machine for Facial Action Unit Detection
    Zeng, Jiabei
    Chu, Wen-Sheng
    De la Torre, Fernando
    Cohn, Jeffrey F.
    Xiong, Zhang
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3622 - 3630
  • [24] Stacking multiple cues for facial action unit detection
    Akay, Simge
    Arica, Nafiz
    VISUAL COMPUTER, 2022, 38 (12): : 4235 - 4250
  • [25] Confidence Preserving Machine for Facial Action Unit Detection
    Zeng, Jiabei
    Chu, Wen-Sheng
    De la Torre, Fernando
    Cohn, Jeffrey F.
    Xiong, Zhang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (10) : 4753 - 4767
  • [26] Enhanced Facial Emotion Recognition Using Vision Transformer Models
    Fatima, N. Sabiyath
    Deepika, G.
    Anthonisamy, Arun
    Chitra, R. Jothi
    Muralidharan, J.
    Alagarsamy, Manjunathan
    Ramyasree, Kummari
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2025, 20 (02) : 1143 - 1152
  • [27] View-Independent Facial Action Unit Detection
    Tang, Chuangao
    Zheng, Wenming
    Yan, Jingwei
    Li, Qiang
    Li, Yang
    Zhang, Tong
    Cui, Zhen
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 878 - 882
  • [28] Detection of faces and facial landmarks using iconic filter banks
    Takacs, B
    Wechsler, H
    PATTERN RECOGNITION, 1997, 30 (10) : 1623 - 1636
  • [29] Driver Drowsiness Detection Using Vision Transformer
    Usmani, Shaheen
    Chandwani, Bharat
    Sadhya, Debanjan
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 445 - 454
  • [30] Face Mask Detection using Vision Transformer
    Pandya, Bhavik
    Patel, Darshana
    Yow, Kin-Choong
    2023 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE, 2023,