Facial Expression Recognition Based on Squeeze Vision Transformer

被引:12
|
作者
Kim, Sangwon [1 ]
Nam, Jaeyeal [1 ]
Ko, Byoung Chul [1 ]
机构
[1] Keimyung Univ, Dept Comp Engn, Daegu 42601, South Korea
关键词
facial expression recognition; vision transformer; squeeze module; visual token; landmark token;
D O I
10.3390/s22103729
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In recent image classification approaches, a vision transformer (ViT) has shown an excellent performance beyond that of a convolutional neural network. A ViT achieves a high classification for natural images because it properly preserves the global image features. Conversely, a ViT still has many limitations in facial expression recognition (FER), which requires the detection of subtle changes in expression, because it can lose the local features of the image. Therefore, in this paper, we propose Squeeze ViT, a method for reducing the computational complexity by reducing the number of feature dimensions while increasing the FER performance by concurrently combining global and local features. To measure the FER performance of Squeeze ViT, experiments were conducted on lab-controlled FER datasets and a wild FER dataset. Through comparative experiments with previous state-of-the-art approaches, we proved that the proposed method achieves an excellent performance on both types of datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Swin-FER: Swin Transformer for Facial Expression Recognition
    Bie, Mei
    Xu, Huan
    Gao, Yan
    Song, Kai
    Che, Xiangjiu
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [32] Micro Expression Recognition Using Convolution Patch in Vision Transformer
    Indolia, Sakshi
    Nigam, Swati
    Singh, Rajiv
    Singh, Vivek Kumar
    Singh, Manoj Kumar
    [J]. IEEE ACCESS, 2023, 11 : 100495 - 100507
  • [33] WiFE: WiFi and Vision Based Unobtrusive Emotion Recognition via Gesture and Facial Expression
    Gu, Yu
    Zhang, Xiang
    Yan, Huan
    Huang, Jingyang
    Liu, Zhi
    Dong, Mianxiong
    Ren, Fuji
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2567 - 2581
  • [34] Vision Based Facial Expression Recognition Using Eigenfaces and Multi-SVM Classifier
    Maw, Hla Myat
    Thu, Soe Myat
    Mon, Myat Thida
    [J]. Communications in Computer and Information Science, 2020, 1287 : 662 - 673
  • [35] Facial micro-expression recognition using three-stream vision transformer network with sparse sampling and relabeling
    Zhang, He
    Yin, Lu
    Zhang, Hanling
    Wu, Xuesong
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3761 - 3771
  • [36] Facial micro-expression recognition using three-stream vision transformer network with sparse sampling and relabeling
    He Zhang
    Lu Yin
    Hanling Zhang
    Xuesong Wu
    [J]. Signal, Image and Video Processing, 2024, 18 : 3761 - 3771
  • [37] Facial Micro-Expression Recognition Enhanced by Score Fusion and a Hybrid Model from Convolutional LSTM and Vision Transformer
    Zheng, Yufeng
    Blasch, Erik
    [J]. SENSORS, 2023, 23 (12)
  • [38] A human activity recognition method based on Vision Transformer
    Han, Huiyan
    Zeng, Hongwei
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] Facial Expression Recognition Based on Fine-Tuned Channel-Spatial Attention Transformer
    Yao, Huang
    Yang, Xiaomeng
    Chen, Di
    Wang, Zhao
    Tian, Yuan
    [J]. SENSORS, 2023, 23 (15)
  • [40] CL-TransFER: Collaborative learning based transformer for facial expression recognition with masked reconstruction
    Yang, Yujie
    Hu, Lin
    Zu, Chen
    Zhang, Jianjia
    Hou, Yun
    Chen, Ying
    Zhou, Jiliu
    Zhou, Luping
    Wang, Yan
    [J]. PATTERN RECOGNITION, 2024, 156