Facial expression recognition based on attention mechanism ResNet lightweight network

被引:0
|
作者
Zhao Xiao [1 ]
Yang Chen [1 ]
Wang Ruo-nan [1 ]
Li Yue-chen [1 ]
机构
[1] Shaanxi Univ Sci & Technol, Sch Elect Informat & Artificial Intelligence, Xian 710021, Peoples R China
基金
中国国家自然科学基金;
关键词
lightweight resnet network; multi-scale spatial feature fusion; facial expression recognition; attention mechanism;
D O I
10.37188/CJLCD.2023-0046
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
Aiming at the problems of large network model and low accuracy of ResNet18 network model in facial expression recognition,a Lightweight ResNet based on multi-scale CBAM(Convolutional Block Attention Module) attention mechanism (MCLResNet) is proposed,which can realize facial expression recognition with less parameters and higher accuracy. Firstly,ResNet18 is used as the backbone network to extract features,and group convolution is introduced to reduce the parameters quantity of ResNet18. The inverted residual structure is used to increase the network depth and optimized the effect of image feature extraction. Secondly,the shared fully connected layer in the channel attention module of CBAM is replaced with a 1x3 convolution module,which effectively reduces the loss of channel information. The multi-scale convolution module is added to the CBAM spatial attention module to obtain spatial feature information at different scales. Finally,multi-scale CBAM module(MSCBAM)is added to the lightweight ResNet model,which effectively increases the feature expression ability of the network model. In addition, a fully connected layer is added to the output layer of the network model introduced into MSCBAM,so as to increase the nonlinear representation of the model at the output. The experimental results of the model on FER2013dataset and CK+ dataset show that the parameters quantity of the model proposed in this paper is reduced by 82. 58% compared with ResNet18,and the recognition accuracy is better.
引用
收藏
页码:1503 / 1510
页数:8
相关论文
共 21 条
  • [1] Facial Expression Recognition By Using a Disentangled Identity-Invariant Expression Representation
    Ali, Kamran
    Hughes, Charles E.
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9460 - 9467
  • [2] Expression recognition based on residual rectifier enhanced convolution neural network
    Chen Bin
    Zhu Jin-ning
    Dong Yi-zhou
    [J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2020, 35 (12) : 1299 - 1308
  • [3] CHEN J H, 2020, Computer Knowledge and Technology, V16, P187
  • [4] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [5] Adversarially Adaptive Normalization for Single Domain Generalization
    Fan, Xinjie
    Wang, Qifei
    Ke, Junjie
    Yang, Feng
    Gong, Boqing
    Zhou, Mingyuan
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8204 - 8213
  • [6] [付国栋 Fu Guodong], 2021, [计算机工程与应用, Computer Engineering and Application], V57, P150
  • [7] Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Kanade T., 2010, The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression, P94, DOI 10.1109/CVPRW.2010.5543262
  • [10] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90