MMATrans: Muscle Movement Aware Representation Learning for Facial Expression Recognition via Transformers

被引:13
|
作者
Liu, Hai [1 ]
Zhou, Qiyun [1 ]
Zhang, Cheng [1 ]
Zhu, Junyan [1 ]
Liu, Tingting [2 ,3 ]
Zhang, Zhaoli [1 ]
Li, You-Fu [4 ,5 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China
[2] Univ Hong Kong, Fac Educ, Hong Kong, Peoples R China
[3] Hubei Univ, Sch Educ, Wuhan 430062, Peoples R China
[4] City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China
[5] City Univ Hong Kong Shenzhen Res Inst, Shenzhen 518057, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Facial muscles; Muscles; Visualization; Transformers; Semantics; Representation learning; Critical minority; facial expression recognition (FER); facial muscle movement; human-robot interaction; semantic relationships; visual transformer;
D O I
10.1109/TII.2024.3431640
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
How to automatically recognize facial expression has caused concerns in industrial human-robot interaction. However, facial expression recognition (FER) is susceptible to problems, such as occlusion, arbitrary orientations, and illumination. To effectively address these challenges in FER, we present a novel facial muscle movement aware representation learning that can learn the semantic relationships of facial muscle movements in facial expression images. Two key findings are revealed: 1) muscle movements from different facial regions often show semantic relationships; and 2) not all facial muscle regions have equal contributions for different facial expressions. On this basis, this model presents two novel modules, namely, discriminative feature generation (DFG) and muscle relationship mining (MRM). Specifically, in DFG, the memory of our model for mislabeling decreases. In MRM, muscle-motion interaction among diverse facial regions is learned through visual transformers (MMATrans). Experiments on three in-the-wild FER datasets (RAF-DB, FERPlus, and AffectNet) show that our MMATrans yields better performance compared with state-of-the-art methods.
引用
收藏
页码:13753 / 13764
页数:12
相关论文
共 50 条
  • [31] Adaptive Deep Metric Learning for Identity-Aware Facial Expression Recognition
    Liu, Xiaofeng
    Kumar, B. V. K. Vijaya
    You, Jane
    Jia, Ping
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 522 - 531
  • [32] Hybrid Attention-Aware Learning Network for Facial Expression Recognition in the Wild
    Gong, Weijun
    La, Zhiyao
    Qian, Yurong
    Zhou, Weihang
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (09) : 12203 - 12217
  • [33] Secondary Information Aware Facial Expression Recognition
    Tian, Ye
    Cheng, Jingchun
    Li, Yali
    Wang, Shengjin
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (12) : 1753 - 1757
  • [34] Relation-Aware Facial Expression Recognition
    Xia, Yifan
    Yu, Hui
    Wang, Xiao
    Jian, Muwei
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (03) : 1143 - 1154
  • [35] Robust Facial Expression Recognition via Sparse Representation and Multiple Gabor filters
    El-Sayed, Rania Salah
    El Kholy, Ahmed
    El-Nahas, Mohamed Youssri
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (03) : 82 - 87
  • [36] Facial expression recognition via sparse representation using positive and reverse templates
    Jiang, Xingguo
    Feng, Bin
    Jin, Liangnian
    IET IMAGE PROCESSING, 2016, 10 (08) : 616 - 623
  • [37] Cross-Domain Facial Expression Recognition via Disentangling Identity Representation
    Liu, Tong
    Li, Jing
    Wu, Jia
    Zhang, Lefei
    Zhao, Shanshan
    Chang, Jun
    Wan, Jun
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1213 - 1221
  • [38] Robust facial expression recognition via sparse representation over overcomplete dictionaries
    Xia, Haiying
    Xu, Ruyi
    Song, Shuxiang
    Journal of Computational Information Systems, 2012, 8 (01): : 425 - 433
  • [39] Learning transferable non-negative feature representation for facial expression recognition
    Ji, Liang
    Song, Peng
    Zhang, Wenjing
    Li, Shaokai
    DIGITAL SIGNAL PROCESSING, 2023, 139
  • [40] Bilateral Hemiface Feature Representation Learning for Pose Robust Facial Expression Recognition
    Baddar, Wissam J.
    Ro, Yong Man
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,