UA-FER: Uncertainty-aware representation learning for facial expression recognition

被引:1
|
作者
Zhou, Haoliang [1 ]
Huang, Shucheng [1 ]
Xu, Yuqiao [2 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212003, Peoples R China
[2] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
Facial expression recognition; Uncertainty-aware representation learning; Evidential deep learning; Vision-language pre-training model; Knowledge distillation; FEATURES;
D O I
10.1016/j.neucom.2024.129261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial Expression Recognition (FER) remains a challenging task due to unconstrained conditions like variations in illumination, pose, and occlusion. Current FER approaches mainly focus on learning discriminative features through local attention and global perception of visual encoders, while neglecting the rich semantic information in the text modality. Additionally, these methods rely solely on the softmax-based activation layer for predictions, resulting in overconfident decision-making that hampers the effective handling of uncertain samples and relationships. Such insufficient representations and overconfident predictions degrade recognition performance, particularly in unconstrained scenarios. To tackle these issues, we propose an end-to-end FER framework called UA-FER, which integrates vision-language pre-training (VLP) models with evidential deep learning (EDL) theory to enhance recognition accuracy and robustness. Specifically, to identify multi-grained discriminative regions, we propose the Multi-granularity Feature Decoupling (MFD) module, which decouples global and local facial representations based on image-text affinity while distilling the universal knowledge from the pre-trained VLP models. Additionally, to mitigate misjudgments in uncertain visual-textual relationships, we introduce the Relation Uncertainty Calibration (RUC) module, which corrects these uncertainties using EDL theory. In this way, the model enhances its ability to capture emotion-related discriminative representations and tackle uncertain relationships, thereby improving overall recognition accuracy and robustness. Extensive experiments on in-the-wild and in-the-lab datasets demonstrate that our UA-FER outperforms the state-of-the-art models.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Uncertainty-aware Label Distribution Learning for Facial Expression Recognition
    Le, Nhat
    Nguyen, Khanh
    Tran, Quang
    Tjiputra, Erman
    Le, Bac
    Nguyen, Anh
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6077 - 6086
  • [2] Uncertainty-Aware and Class-Balanced Facial Expression Recognition
    Hong J.
    Tian M.
    Huang Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (10): : 1532 - 1540
  • [3] Uncertainty-Aware Representation Learning for Action Segmentation
    Chen, Lei
    Li, Muheng
    Duan, Yueqi
    Zhou, Jie
    Lu, Jiwen
    PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 820 - 826
  • [4] Patch-Aware Representation Learning for Facial Expression Recognition
    Wu, Yi
    Wang, Shangfei
    Chang, Yanan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6143 - 6151
  • [5] DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition
    Li, Ming
    Fu, Huazhu
    He, Shengfeng
    Fan, Hehe
    Liu, Jun
    Keppo, Jussi
    Shou, Mike Zheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6297 - 6309
  • [6] Uncertainty-Aware Multi-View Representation Learning
    Geng, Yu
    Han, Zongbo
    Zhang, Changqing
    Hu, Qinghua
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7545 - 7553
  • [7] Uncertainty-aware Cross-dataset Facial Expression Recognition via Regularized Conditional Alignment
    Zhou, Linyi
    Fan, Xijian
    Ma, Yingjie
    Tjahjadi, Tardi
    Ye, Qiaolin
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2964 - 2972
  • [8] Uncertainty-Aware Heterogeneous Representation Learning in POI Recommender Systems
    Zhou, Fan
    Qian, Tangjiang
    Mo, Yuhua
    Cheng, Zhangtao
    Xiao, Chunjing
    Wu, Jin
    Trajcevski, Goce
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (07): : 4522 - 4535
  • [9] UCoL: Unsupervised Learning of Discriminative Facial Representations via Uncertainty-Aware Contrast
    Wang, Hao
    Li, Min
    Song, Yangyang
    Zhang, Youjian
    Chi, Liying
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2510 - 2518
  • [10] MMATrans: Muscle Movement Aware Representation Learning for Facial Expression Recognition via Transformers
    Liu, Hai
    Zhou, Qiyun
    Zhang, Cheng
    Zhu, Junyan
    Liu, Tingting
    Zhang, Zhaoli
    Li, You-Fu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (12) : 13753 - 13764