Facial expression recognition based on attention mechanism ResNet lightweight network

被引：0

作者：

Zhao Xiao ^{[1
]}

Yang Chen ^{[1
]}

Wang Ruo-nan ^{[1
]}

Li Yue-chen ^{[1
]}

机构：

[1] Shaanxi Univ Sci & Technol, Sch Elect Informat & Artificial Intelligence, Xian 710021, Peoples R China

来源：

CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS | 2023年 / 38卷 / 11期

基金：

中国国家自然科学基金;

关键词：

lightweight resnet network; multi-scale spatial feature fusion; facial expression recognition; attention mechanism;

D O I：

10.37188/CJLCD.2023-0046

中图分类号：

O7 [晶体学];

学科分类号：

0702 ; 070205 ; 0703 ; 080501 ;

摘要：

Aiming at the problems of large network model and low accuracy of ResNet18 network model in facial expression recognition,a Lightweight ResNet based on multi-scale CBAM(Convolutional Block Attention Module) attention mechanism (MCLResNet) is proposed,which can realize facial expression recognition with less parameters and higher accuracy. Firstly,ResNet18 is used as the backbone network to extract features,and group convolution is introduced to reduce the parameters quantity of ResNet18. The inverted residual structure is used to increase the network depth and optimized the effect of image feature extraction. Secondly,the shared fully connected layer in the channel attention module of CBAM is replaced with a 1x3 convolution module,which effectively reduces the loss of channel information. The multi-scale convolution module is added to the CBAM spatial attention module to obtain spatial feature information at different scales. Finally,multi-scale CBAM module(MSCBAM)is added to the lightweight ResNet model,which effectively increases the feature expression ability of the network model. In addition, a fully connected layer is added to the output layer of the network model introduced into MSCBAM,so as to increase the nonlinear representation of the model at the output. The experimental results of the model on FER2013dataset and CK+ dataset show that the parameters quantity of the model proposed in this paper is reduced by 82. 58% compared with ResNet18,and the recognition accuracy is better.

引用

页码：1503 / 1510

页数：8

共 21 条

[1] Facial Expression Recognition By Using a Disentangled Identity-Invariant Expression Representation
Ali, Kamran
Hughes, Charles E.
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9460 - 9467
[2] Expression recognition based on residual rectifier enhanced convolution neural network
Chen Bin
Zhu Jin-ning
Dong Yi-zhou
[J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2020, 35 (12) : 1299 - 1308
[3] CHEN J H, 2020, Computer Knowledge and Technology, V16, P187
[4] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[5] Adversarially Adaptive Normalization for Single Domain Generalization
Fan, Xinjie
Wang, Qifei
Ke, Junjie
Yang, Feng
Gong, Boqing
Zhou, Mingyuan
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8204 - 8213
[6] [付国栋 Fu Guodong], 2021, [计算机工程与应用, Computer Engineering and Application], V57, P150
[7] Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
[8] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[9] Kanade T., 2010, The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression, P94, DOI 10.1109/CVPRW.2010.5543262
[10] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90

← 1 2 3 →