A Novel Transformer-Based Approach for Adult's Facial Emotion Recognition

被引:0
|
作者
Nawaz, Uzma [1 ]
Saeed, Zubair [2 ,3 ]
Atif, Kamran [4 ]
机构
[1] Natl Univ Sci & Technol, Coll Elect & Mech Engn, Knowledge & Data Sci Res Ctr, Dept Comp & Software Engn, Islamabad 44000, Pakistan
[2] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77840 USA
[3] Texas A&M Univ Qatar, Dept Elect & Comp Engn, Doha, Qatar
[4] Deakin Univ, Dept Civil Engn, Melbourne, Vic 3125, Australia
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Emotion recognition; Transformers; Face recognition; Accuracy; Brain modeling; Real-time systems; Adaptation models; Lighting; Human computer interaction; Facial features; Facial emotion recognition; transformers; deep learning; FER2013; CK plus; AffectNet; AFEW; RAF-DB; emotion recognition; EXPRESSION RECOGNITION;
D O I
10.1109/ACCESS.2025.3555510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adult facial expression recognition (FER) is essential for human-computer interaction, mental health assessment, and social robotics applications because it improves user experiences and emotional well-being. This study presents a novel attention mechanism-based transformer approach designed to capture detailed patterns in facial features and dynamically focus on the most relevant regions for enhanced accuracy. Unlike conventional deep learning approaches, our method integrates an adaptive attention mechanism and dynamic token pruning, which optimizes computational efficiency while maintaining high accuracy. The model is evaluated on five widely used datasets: FER2013, CK+, AffectNet, RAF-DB, and AFEW. It achieves state-of-the-art performance, with accuracies of 98.67% on FER2013, 99.52% on CK+, 99.3% on AffectNet, 96.3% on AFEW, and 98.45% on RAF-DB. An ablation study further validates the contribution of each model component, and comparisons with CNN-based and transformer-based approaches confirm the effectiveness of the model. These findings establish the proposed method as a significant advancement in FER, which offers a scalable and efficient solution for real-world applications.
引用
收藏
页码:56485 / 56508
页数:24
相关论文
共 50 条
  • [21] TDFNet: Transformer-Based Deep-Scale Fusion Network for Multimodal Emotion Recognition
    Zhao, Zhengdao
    Wang, Yuhua
    Shen, Guang
    Xu, Yuezhu
    Zhang, Jiayuan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3771 - 3782
  • [22] Transformer-Based Self-Supervised Multimodal Representation Learning for Wearable Emotion Recognition
    Wu, Yujin
    Daoudi, Mohamed
    Amad, Ali
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (01) : 157 - 172
  • [23] Transformer-Based Multilingual Speech Emotion Recognition Using Data Augmentation and Feature Fusion
    Al-onazi, Badriyya B.
    Nauman, Muhammad Asif
    Jahangir, Rashid
    Malik, Muhmmad Mohsin
    Alkhammash, Eman H.
    Elshewey, Ahmed M.
    APPLIED SCIENCES-BASEL, 2022, 12 (18):
  • [24] A transformer-based network for speech recognition
    Tang L.
    International Journal of Speech Technology, 2023, 26 (02) : 531 - 539
  • [25] SketchFormer: transformer-based approach for sketch recognition using vector images
    Anil Singh Parihar
    Gaurav Jain
    Shivang Chopra
    Suransh Chopra
    Multimedia Tools and Applications, 2021, 80 : 9075 - 9091
  • [26] Transformer-based approach for printing quality recognition in fused filament fabrication
    Xing Quan Wang
    Zeqing Jin
    Bowen Zheng
    Grace X. Gu
    npj Advanced Manufacturing, 2 (1):
  • [27] Multi-Label Multimodal Emotion Recognition With Transformer-Based Fusion and Emotion-Level Representation Learning
    Le, Hoai-Duy
    Lee, Guee-Sang
    Kim, Soo-Hyung
    Kim, Seungwon
    Yang, Hyung-Jeong
    IEEE ACCESS, 2023, 11 : 14742 - 14751
  • [28] SketchFormer: transformer-based approach for sketch recognition using vector images
    Parihar, Anil Singh
    Jain, Gaurav
    Chopra, Shivang
    Chopra, Suransh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9075 - 9091
  • [29] A Fuzzy Approach for Facial Emotion Recognition
    Farahani, Fatemeh Shahrabi
    Sheikhan, Mansour
    Farrokhi, Ali
    2013 13TH IRANIAN CONFERENCE ON FUZZY SYSTEMS (IFSC), 2013,
  • [30] TNTC: TWO-STREAM NETWORK WITH TRANSFORMER-BASED COMPLEMENTARITY FOR GAIT-BASED EMOTION RECOGNITION
    Hu, Chuanfei
    Sheng, Weijie
    Dong, Bo
    Li, Xinde
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3229 - 3233