VaBTFER: An Effective Variant Binary Transformer for Facial Expression Recognition

被引:2
|
作者
Shen, Lei [1 ]
Jin, Xing [1 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, Nanjing 100190, Peoples R China
关键词
facial expression recognition; spatial-channel feature relevance Transformer; lightweight variant Transformer; binary quantization mechanism; multilayer channel reduction self-attention; dynamic learnable information extraction; NEURAL-NETWORK;
D O I
10.3390/s24010147
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Existing Transformer-based models have achieved impressive success in facial expression recognition (FER) by modeling the long-range relationships among facial muscle movements. However, the size of pure Transformer-based models tends to be in the million-parameter level, which poses a challenge for deploying these models. Moreover, the lack of inductive bias in Transformer usually leads to the difficulty of training from scratch on limited FER datasets. To address these problems, we propose an effective and lightweight variant Transformer for FER called VaTFER. In VaTFER, we firstly construct action unit (AU) tokens by utilizing action unit-based regions and their histogram of oriented gradient (HOG) features. Then, we present a novel spatial-channel feature relevance Transformer (SCFRT) module, which incorporates multilayer channel reduction self-attention (MLCRSA) and a dynamic learnable information extraction (DLIE) mechanism. MLCRSA is utilized to model long-range dependencies among all tokens and decrease the number of parameters. DLIE's goal is to alleviate the lack of inductive bias and improve the learning ability of the model. Furthermore, we use an excitation module to replace the vanilla multilayer perception (MLP) for accurate prediction. To further reduce computing and memory resources, we introduce a binary quantization mechanism, formulating a novel lightweight Transformer model called variant binary Transformer for FER (VaBTFER). We conduct extensive experiments on several commonly used facial expression datasets, and the results attest to the effectiveness of our methods.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Facial expression recognition by considering nonuniform local binary patterns
    Srinivasa Reddy, K.
    Sunil Reddy, E.
    Baswanth, N.
    Advances in Intelligent Systems and Computing, 2019, 906 : 645 - 658
  • [22] Spatiotemporal Local Monogenic Binary Patterns for Facial Expression Recognition
    Huang, Xiaohua
    Zhao, Guoying
    Zheng, Wenming
    Pietikainen, Matti
    IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (05) : 243 - 246
  • [23] Automatic Facial Expression Recognition Using Local Binary Pattern
    Wang, Wencheng
    Chang, Faliang
    Zhao, Jianguo
    Chen, Zhenxue
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 6375 - 6378
  • [24] Facial expression recognition using local binary covariance matrices
    Guo, Song
    Ruan, Qiuqi
    2011 IET 4TH INTERNATIONAL CONFERENCE ON WIRELESS, MOBILE & MULTIMEDIA NETWORKS (ICWMMN 2011), 2011, : 237 - 242
  • [25] Robust facial expression recognition using local binary patterns
    Shan, CF
    Gong, SG
    McOwan, PW
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 2225 - 2228
  • [26] FACIAL EXPRESSION RECOGNITION USING ENHANCED LOCAL BINARY PATTERNS
    Ekweariri, Augustine Nnamdi
    Yurtkan, Kamil
    2017 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2017, : 43 - 47
  • [27] Facial Expression Recognition Using Modified Local Binary Pattern
    Biswas, Suparna
    Sil, Jaya
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 595 - 604
  • [28] Facial Expression Recognition Based on Deep Binary Convolutional Network
    Zhou L.
    Liu J.
    Li W.
    Mi J.
    Lei B.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (03): : 425 - 436
  • [29] Facial expression recognition with local binary pattern and laplacian eigenmaps
    School of Information, Wuyi University, Jiangmen, Guangdong 529020, China
    不详
    Lect. Notes Comput. Sci., (228-235):
  • [30] Facial Expression Recognition Based on Vision Transformer with Hybrid Local Attention
    Tian, Yuan
    Zhu, Jingxuan
    Yao, Huang
    Chen, Di
    APPLIED SCIENCES-BASEL, 2024, 14 (15):