Transformer embedded spectral-based graph network for facial expression recognition

被引:4
|
作者
Jin, Xing [1 ]
Song, Xulin [2 ]
Wu, Xiyin [3 ]
Yan, Wenzhu [4 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, 159 Longpan Rd, Nanjing 210037, Jiangsu, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Internet Things, 9 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
[3] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Guizhou, Peoples R China
[4] Nanjing Normal Univ, Sch Artificial Intelligence, Sch Comp & Elect Informat, 1 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph convolution; Facial expression; Transformer encoder; Action units;
D O I
10.1007/s13042-023-02016-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep graph convolution networks which exploit relations among facial muscle movements for facial expression recognition (FER) have achieved great success. Due to the limited receptive field, existing graph convolution operations are difficult to model long-range muscle movement relations which plays a crucial role in FER. To alleviate this issue, we introduce the transformer encoder into graph convolution networks, in which the vision transformer enables all facial muscle movements to interact in the global receptive field and model more complex relations. Specifically, we construct facial graph data by cropping regions of interest (ROIs) which are associated with facial action units, and each ROI is represented by the representation of hidden layers from deep auto-encoder. To effectively extract features from the constructed facial graph data, we propose a novel transformer embedded spectral-based graph convolution network (TESGCN), in which the transformer encoder is exploited to interact with complex relations among facial RIOs for FER. Compared to vanilla graph convolution networks, we empirically show the superiority of the proposed model by conducting extensive experiments across four facial expression datasets. Moreover, our proposed TESGCN only has 80K parameters and 0.41MB model size, and achieves comparable results compared to existing lightweight networks.
引用
收藏
页码:2063 / 2077
页数:15
相关论文
共 50 条
  • [21] Facial Expression Recognition Based On Residual Network
    Jiang, Qiqi
    Peng, Xiwei
    Chen, Hanyu
    Guo, Yujie
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7000 - 7006
  • [22] Facial Expression Recognition Method Embedded with Attention Mechanism Residual Network
    Zhong, Rui
    Jiang, Bin
    Li, Nanxing
    Cui, Xiaomei
    Computer Engineering and Applications, 2023, 59 (11) : 88 - 97
  • [23] Learning the Connectivity: Situational Graph Convolution Network for Facial Expression Recognition
    Zhou, Jinzhao
    Zhang, Xingming
    Liu, Yang
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 230 - 234
  • [24] Facial micro-expression recognition based on motion magnification network and graph attention mechanism
    Wu, Falin
    Xia, Yu
    Hu, Tiangyang
    Ma, Boyi
    Yang, Jingyao
    Li, Haoxin
    HELIYON, 2024, 10 (16)
  • [25] Facial Expression Recognition via Deep Action Units Graph Network Based on Psychological Mechanism
    Liu, Yang
    Zhang, Xingming
    Lin, Yubei
    Wang, Haoxiang
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2020, 12 (02) : 311 - 322
  • [26] Athlete facial micro-expression recognition method based on graph convolutional neural network
    Xu, Haochen
    Zhu, Zhiqiang
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2024, 16 (05) : 478 - 496
  • [27] Expression snippet transformer for robust video-based facial expression recognition
    Liu, Yuanyuan
    Wang, Wenbin
    Feng, Chuanxu
    Zhang, Haoyu
    Chen, Zhe
    Zhan, Yibing
    PATTERN RECOGNITION, 2023, 138
  • [28] Windmill Graph based Feature Descriptors for Facial Expression Recognition
    Kartheek, Mukku Nisanth
    Prasad, V. N. K. Munaga
    Bhukya, Raju
    OPTIK, 2022, 260
  • [29] POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition
    Zheng, Ce
    Mendieta, Matias
    Chen, Chen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3138 - 3147
  • [30] Transformer-Augmented Network With Online Label Correction for Facial Expression Recognition
    Ma, Fuyan
    Sun, Bin
    Li, Shutao
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (02) : 593 - 605