A Region Group Adaptive Attention Model For Subtle Expression Recognition

被引:7
|
作者
Chen, Gan [1 ,2 ]
Peng, Junjie [1 ,3 ]
Zhang, Wenqiang [4 ,5 ]
Huang, Kanrun [6 ]
Cheng, Feng [7 ]
Yuan, Haochen [1 ]
Huang, Yansong [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Univ Sci & Technol, Coll Applicat Sci, Ganzhou 330003, Jiangxi, Peoples R China
[3] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai 200444, Peoples R China
[4] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[5] Fudan Univ, Sch Comp Sci & Technol, Shanghai 200433, Peoples R China
[6] Nauto Inc, Palo Alto, CA 94306 USA
[7] Hasso Plattner Inst, D-14482 Postsdam, Germany
关键词
Regions of interest; region group adaptive attention; subtle expression recognition; CONVOLUTIONAL NEURAL-NETWORK; ACTION-UNITS;
D O I
10.1109/TAFFC.2021.3133429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expression recognition has received extensive attention in recent years due to its important applications in many fields. Most expression samples used in research are relatively easy to analyze emotions because they have explicit expressions with strong intensities. However, in situations such as video question and answer, business negotiation, polygraph detection in the security field, autism treatment and medical escort, emotions are expressed in suppressed manners with low intensive expression or subtle expressions, making it difficult to estimate emotions accurately. In these situations, how to effectively extract expression features from facial expression images is a critical problem that affects the accuracy of subtle expression recognition. To address this problem, we propose an end-to-end group adaptive attention model for subtle expression recognition. Cropping an image into several regions of interest (ROI) according to the correlations between facial skeleton and emotions, the proposed model analyses the relationship among regions of interest, and mutual relations between local regions and the holistic region. Using the region group adaptive attention mechanism, the model effectively trains the convolutional neural network to efficiently extract facial expressions representing features and increases the accuracy and robustness of the recognition, particularly in some subtle facial expression circumstances. To improve the ability of different regional features to discriminate expressions, a group adaptive loss function is introduced to verify and improve estimation accuracy. Extensive experiments are conducted on the existing public face datasets CK+, JAFFE, KDEF and the self-collected subtle expression dataset SFER. Results show that the proposed model achieves accuracies of 99.59%, 95.20%, and 93.47% with datasets CK+, JAFFE, and KDEF, respectively. The proposed model thus generally achieves better performance in facial expression recognition than other methods.
引用
收藏
页码:1613 / 1626
页数:14
相关论文
共 50 条
  • [31] ATTENTION MODEL OF INFANT RECOGNITION
    FAGAN, JF
    CHILD DEVELOPMENT, 1977, 48 (02) : 345 - 359
  • [32] Spontaneous subtle expression detection and recognition based on facial strain
    Liong, Sze-Teng
    See, John
    Phan, Raphael C. -W.
    Oh, Yee-Hui
    Le Ngo, Anh Cat
    Wong, KokSheik
    Tan, Su-Wei
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 170 - 182
  • [33] Proposal of Recurrent Attention Module for Capturing Subtle Facial Expression Changes
    Miyoshi, Ryo
    Nagata, Noriko
    Hashimoto, Manabu
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2022, 88 (02): : 168 - 173
  • [34] Facial Expression Recognition with Attention Mechanism
    Wang, Caixia
    Wang, Zhihui
    Cui, Dong
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [35] Adaptive spatio-temporal attention neural network for crossdatabase micro-expression recognition
    Ran Y.
    ZHENG W.
    ZONG Y.
    LIU J.
    Virtual Reality and Intelligent Hardware, 2023, 5 (02): : 142 - 156
  • [36] A neural network model for pattern recognition based on hypothesis and verification with moving region of attention
    Shimomura, M
    Satoh, S
    Miyake, S
    Aso, H
    ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 1275 - 1280
  • [37] Efficient DenseNet Model with Fusion of Channel and Spatial Attention for Facial Expression Recognition
    Long, Duong Thang
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2024, 24 (01) : 171 - 189
  • [38] ROI-Attention Vectorized CNN Model for Static Facial Expression Recognition
    Sun, Xiao
    Zheng, Shixin
    Fu, Hongshuai
    IEEE ACCESS, 2020, 8 : 7183 - 7194
  • [39] Emotion-modulated attention improves expression recognition: A deep learning model
    Barros, Pablo
    Parisi, German I.
    Weber, Cornelius
    Wermter, Stefan
    NEUROCOMPUTING, 2017, 253 : 104 - 114
  • [40] An adaptive weighted fusion model with two subspaces for facial expression recognition
    Zhe Sun
    Zheng-ping Hu
    Raymond Chiong
    Meng Wang
    Shuhuan Zhao
    Signal, Image and Video Processing, 2018, 12 : 835 - 843