Transformer embedded spectral-based graph network for facial expression recognition

被引:4
|
作者
Jin, Xing [1 ]
Song, Xulin [2 ]
Wu, Xiyin [3 ]
Yan, Wenzhu [4 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, 159 Longpan Rd, Nanjing 210037, Jiangsu, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Internet Things, 9 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
[3] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Guizhou, Peoples R China
[4] Nanjing Normal Univ, Sch Artificial Intelligence, Sch Comp & Elect Informat, 1 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph convolution; Facial expression; Transformer encoder; Action units;
D O I
10.1007/s13042-023-02016-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep graph convolution networks which exploit relations among facial muscle movements for facial expression recognition (FER) have achieved great success. Due to the limited receptive field, existing graph convolution operations are difficult to model long-range muscle movement relations which plays a crucial role in FER. To alleviate this issue, we introduce the transformer encoder into graph convolution networks, in which the vision transformer enables all facial muscle movements to interact in the global receptive field and model more complex relations. Specifically, we construct facial graph data by cropping regions of interest (ROIs) which are associated with facial action units, and each ROI is represented by the representation of hidden layers from deep auto-encoder. To effectively extract features from the constructed facial graph data, we propose a novel transformer embedded spectral-based graph convolution network (TESGCN), in which the transformer encoder is exploited to interact with complex relations among facial RIOs for FER. Compared to vanilla graph convolution networks, we empirically show the superiority of the proposed model by conducting extensive experiments across four facial expression datasets. Moreover, our proposed TESGCN only has 80K parameters and 0.41MB model size, and achieves comparable results compared to existing lightweight networks.
引用
收藏
页码:2063 / 2077
页数:15
相关论文
共 50 条
  • [31] Spectral-Based Graph Neural Networks for Complementary Item Recommendation
    Luo, Haitong
    Meng, Xuying
    Wang, Suhang
    Cao, Hanyun
    Zhang, Weiyao
    Wang, Yequan
    Zhang, Yujun
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 8868 - 8876
  • [32] Enhanced Deep Learning Hybrid Model of CNN Based on Spatial Transformer Network for Facial Expression Recognition
    Khan, Nizamuddin
    Singh, Ajay Vikram
    Agrawal, Rajeev
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (14)
  • [33] Facial Expression Recognition Based on Vision Transformer with Hybrid Local Attention
    Tian, Yuan
    Zhu, Jingxuan
    Yao, Huang
    Chen, Di
    APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [34] A Graph Skeleton Transformer Network for Action Recognition
    Jiang, Yujian
    Sun, Zhaoneng
    Yu, Saisai
    Wang, Shuang
    Song, Yang
    SYMMETRY-BASEL, 2022, 14 (08):
  • [35] Spectral embedding based facial expression recognition with multiple features
    Yu, Kaimin
    Wang, Zhiyong
    Hagenbuchner, Markus
    Feng, David Dagan
    NEUROCOMPUTING, 2014, 129 : 136 - 145
  • [36] Facial expression recognition based on improved residual network
    Zhang, Weiguang
    Zhang, Xuguang
    Tang, Yinggan
    IET IMAGE PROCESSING, 2023, 17 (07) : 2005 - 2014
  • [37] Facial Expression Recognition Based on Convolution Neural Network
    Duan, Yue
    Zhou, Linli
    Wu, Yue
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE & APPLICATION TECHNOLOGY (ICCIA 2017), 2017, 74 : 339 - 343
  • [38] Facial expression recognition based on deep residual network
    Qu, Junsuo
    Zhang, Ruijun
    Zhang, Zhiwei
    Pan, Jeng-Shyang
    Journal of Computers (Taiwan), 2020, 31 (02): : 12 - 19
  • [39] Facial Expression Recognition Network Based on Attention Mechanism
    Zhang W.
    Li P.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2022, 55 (07): : 706 - 713
  • [40] Facial Expression Recognition Based on Convolutional Neural Network
    Zhou Yue
    Feng Yanyan
    Zeng Shangyou
    Pan Bing
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 410 - 413