Facial Expression Recognition Based on Squeeze Vision Transformer

被引:12
|
作者
Kim, Sangwon [1 ]
Nam, Jaeyeal [1 ]
Ko, Byoung Chul [1 ]
机构
[1] Keimyung Univ, Dept Comp Engn, Daegu 42601, South Korea
关键词
facial expression recognition; vision transformer; squeeze module; visual token; landmark token;
D O I
10.3390/s22103729
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In recent image classification approaches, a vision transformer (ViT) has shown an excellent performance beyond that of a convolutional neural network. A ViT achieves a high classification for natural images because it properly preserves the global image features. Conversely, a ViT still has many limitations in facial expression recognition (FER), which requires the detection of subtle changes in expression, because it can lose the local features of the image. Therefore, in this paper, we propose Squeeze ViT, a method for reducing the computational complexity by reducing the number of feature dimensions while increasing the FER performance by concurrently combining global and local features. To measure the FER performance of Squeeze ViT, experiments were conducted on lab-controlled FER datasets and a wild FER dataset. Through comparative experiments with previous state-of-the-art approaches, we proved that the proposed method achieves an excellent performance on both types of datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Facial Expression Recognition Based on Vision Transformer with Hybrid Local Attention
    Tian, Yuan
    Zhu, Jingxuan
    Yao, Huang
    Chen, Di
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [2] Vision Transformer With Attentive Pooling for Robust Facial Expression Recognition
    Xue, Fanglei
    Wang, Qiangchang
    Tan, Zichang
    Ma, Zhongsong
    Guo, Guodong
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3244 - 3256
  • [3] VISION TRANSFORMER EQUIPPED WITH NEURAL RESIZER ON FACIAL EXPRESSION RECOGNITION TASK
    Hwang, Hyeonbin
    Kim, Soyeon
    Park, Wei-Jin
    Seo, Jiho
    Ko, Kyungtae
    Yeo, Hyeon
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2614 - 2618
  • [4] Patch attention convolutional vision transformer for facial expression recognition with occlusion
    Liu, Chang
    Hirota, Kaoru
    Dai, Yaping
    [J]. INFORMATION SCIENCES, 2023, 619 : 781 - 794
  • [5] Research on facial recognition of sika deer based on vision transformer
    Gong, He
    Luo, Tianye
    Ni, Lingyun
    Li, Ji
    Guo, Jie
    Liu, Tonghe
    Feng, Ruilong
    Mu, Ye
    Hu, Tianli
    Sun, Yu
    Guo, Ying
    Li, Shijun
    [J]. ECOLOGICAL INFORMATICS, 2023, 78
  • [6] Face-mask-aware Facial Expression Recognition based on Face Parsing and Vision Transformer
    Yang, Bo
    Wu, Jianming
    Ikeda, Kazushi
    Hattori, Gen
    Sugano, Masaru
    Iwasawa, Yusuke
    Matsuo, Yutaka
    [J]. PATTERN RECOGNITION LETTERS, 2022, 164 : 173 - 182
  • [7] Survey of facial expression recognition based on computer vision
    School of Information Engineering, Beijing University of Science and Technology, Beijing 100083, China
    [J]. Jisuanji Gongcheng, 2006, 11 (231-233):
  • [8] AUTOMATIC RECOGNITION OF FACIAL EXPRESSION BASED ON COMPUTER VISION
    Zhu, Shaoping
    [J]. INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2015, 8 (03): : 1464 - 1483
  • [9] Research on Facial Expression Recognition Algorithm Based on Lightweight Transformer
    Jiang, Bin
    Li, Nanxing
    Cui, Xiaomei
    Liu, Weihua
    Yu, Zeqi
    Xie, Yongheng
    [J]. INFORMATION, 2024, 15 (06)
  • [10] PIDViT: Pose-Invariant Distilled Vision Transformer for Facial Expression Recognition in the Wild
    Huang, Yin-Fu
    Tsai, Chia-Hsin
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3281 - 3293