Transformer embedded spectral-based graph network for facial expression recognition

被引：4

作者：

Jin, Xing ^{[1
]}

Song, Xulin ^{[2
]}

Wu, Xiyin ^{[3
]}

Yan, Wenzhu ^{[4
]}

机构：

[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, 159 Longpan Rd, Nanjing 210037, Jiangsu, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Sch Internet Things, 9 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China

[3] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Guizhou, Peoples R China

[4] Nanjing Normal Univ, Sch Artificial Intelligence, Sch Comp & Elect Informat, 1 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2024年 / 15卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Graph convolution; Facial expression; Transformer encoder; Action units;

D O I：

10.1007/s13042-023-02016-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep graph convolution networks which exploit relations among facial muscle movements for facial expression recognition (FER) have achieved great success. Due to the limited receptive field, existing graph convolution operations are difficult to model long-range muscle movement relations which plays a crucial role in FER. To alleviate this issue, we introduce the transformer encoder into graph convolution networks, in which the vision transformer enables all facial muscle movements to interact in the global receptive field and model more complex relations. Specifically, we construct facial graph data by cropping regions of interest (ROIs) which are associated with facial action units, and each ROI is represented by the representation of hidden layers from deep auto-encoder. To effectively extract features from the constructed facial graph data, we propose a novel transformer embedded spectral-based graph convolution network (TESGCN), in which the transformer encoder is exploited to interact with complex relations among facial RIOs for FER. Compared to vanilla graph convolution networks, we empirically show the superiority of the proposed model by conducting extensive experiments across four facial expression datasets. Moreover, our proposed TESGCN only has 80K parameters and 0.41MB model size, and achieves comparable results compared to existing lightweight networks.

引用

页码：2063 / 2077

页数：15

共 50 条

[21] Facial Expression Recognition Based On Residual Network
Jiang, Qiqi
Peng, Xiwei
Chen, Hanyu
Guo, Yujie
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7000 - 7006
[22] Facial Expression Recognition Method Embedded with Attention Mechanism Residual Network
Zhong, Rui
Jiang, Bin
Li, Nanxing
Cui, Xiaomei
Computer Engineering and Applications, 2023, 59 (11) : 88 - 97
[23] Learning the Connectivity: Situational Graph Convolution Network for Facial Expression Recognition
Zhou, Jinzhao
Zhang, Xingming
Liu, Yang
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 230 - 234
[24] Facial micro-expression recognition based on motion magnification network and graph attention mechanism
Wu, Falin
Xia, Yu
Hu, Tiangyang
Ma, Boyi
Yang, Jingyao
Li, Haoxin
HELIYON, 2024, 10 (16)
[25] Facial Expression Recognition via Deep Action Units Graph Network Based on Psychological Mechanism
Liu, Yang
Zhang, Xingming
Lin, Yubei
Wang, Haoxiang
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2020, 12 (02) : 311 - 322
[26] Athlete facial micro-expression recognition method based on graph convolutional neural network
Xu, Haochen
Zhu, Zhiqiang
INTERNATIONAL JOURNAL OF BIOMETRICS, 2024, 16 (05) : 478 - 496
[27] Expression snippet transformer for robust video-based facial expression recognition
Liu, Yuanyuan
Wang, Wenbin
Feng, Chuanxu
Zhang, Haoyu
Chen, Zhe
Zhan, Yibing
PATTERN RECOGNITION, 2023, 138
[28] Windmill Graph based Feature Descriptors for Facial Expression Recognition
Kartheek, Mukku Nisanth
Prasad, V. N. K. Munaga
Bhukya, Raju
OPTIK, 2022, 260
[29] POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition
Zheng, Ce
Mendieta, Matias
Chen, Chen
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3138 - 3147
[30] Transformer-Augmented Network With Online Label Correction for Facial Expression Recognition
Ma, Fuyan
Sun, Bin
Li, Shutao
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (02) : 593 - 605

← 1 2 3 4 5 →