Temporal-spatial correlation and graph attention-guided network for micro-expression recognition in English learning livestreams

被引:0
|
作者
Zhao, Hongxin [1 ]
Kim, Byung-Gyu [2 ]
Slowik, Adam [3 ]
Pan, Daohua [4 ]
机构
[1] Tianjin Univ Finance & Econ, Pearl River Coll, Tianjin 301811, Peoples R China
[2] Sookmyung Womens Univ, Dept Artificial Intelligence Engn, Seoul, South Korea
[3] Koszalin Univ Technol, Coll Comp Sci, Koszalin, Poland
[4] Heilongjiang Vocat Coll Nationalities, Dept Informat Management, Harbin 150066, Peoples R China
关键词
Micro-expression recognition; English learning livestreams; Spatio-temporal graph convolution; Transformer;
D O I
10.1007/s10791-024-09477-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Micro-expressions, fleeting facial movements lasting 1/25 to 1/3 of a second, offer crucial insights into genuine emotions, particularly valuable in online education settings. The rapid growth of English learning livestreams has heightened the need for accurate, real-time micro-expression recognition to enhance learner engagement and instructional effectiveness. However, existing methods need help with the subtle nature of these expressions, especially in dynamic, low-resolution streaming environments. This paper presents TSG-MER-ELL, a novel end-to-end network for micro-expression recognition in English learning livestreams, integrating temporal-spatial correlation and graph attention mechanisms. The framework addresses the unique challenges of real-time emotion analysis in online language education, where subtle facial cues are crucial in understanding learner engagement and comprehension. The temporal-spatial correlation module employs action units with spatio-temporal graph convolution to aggregate features from diverse facial regions, while transformer encoders construct long-range correlations. The graph attention module builds upon local facial areas to guide self-attention computations, yielding precise local correlation features. These global and local features are fused for the final micro-expression classification. We introduce an adaptive loss function that balances accuracy, efficiency, and relevance to linguistic context. Extensive experiments on SMIC, CASME II, and SAMM datasets, adapted for English learning scenarios, demonstrate TSG-MER-ELL's superior performance over ten state-of-the-art baselines. The TSG-MER-ELL framework achieves top UF1 and UAR scores across all datasets, significantly improving recognition speed and accuracy. Ablation studies and visualizations of temporal-spatial features and graph attention weights provide insights into the framework's effectiveness in capturing subtle emotional cues. TSG-MER-ELL's robust performance in varied online learning conditions highlights its potential to enhance engagement, personalize instruction, and improve overall outcomes in virtual English language education.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Micro-expression Recognition Based on Deep Mutual Learning Network
    Xie, Zhihua
    Cheng, Sijia
    Fan, Jiawei
    Huang, Peng
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 751 - 756
  • [22] Micro-Expression Classification based on Landmark Relations with Graph Attention Convolutional Network
    Kumar, Ankith Jain Rakesh
    Bhanu, Bir
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1511 - 1520
  • [23] Micro-Expression Recognition Based on Spatio-Temporal Capsule Network
    Shang, Ziyang
    Liu, Jie
    Li, Xinfu
    IEEE ACCESS, 2023, 11 : 13704 - 13713
  • [24] Attention-guided graph structure learning network for EEG-enabled auditory attention detection
    Zeng, Xianzhang
    Cai, Siqi
    Xie, Longhan
    JOURNAL OF NEURAL ENGINEERING, 2024, 21 (03)
  • [25] Attention-guided spatial-temporal graph relation network for video-based person re-identification
    Qi, Yu
    Ge, Hongwei
    Pei, Wenbin
    Liu, Yuxuan
    Hou, Yaqing
    Sun, Liang
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (19): : 14227 - 14241
  • [26] ME-PLAN: A deep prototypical learning with local attention network for dynamic micro-expression recognition
    Zhao, Sirui
    Tang, Huaying
    Liu, Shifeng
    Zhang, Yangsong
    Wang, Hao
    Xu, Tong
    Chen, Enhong
    Guan, Cuntai
    NEURAL NETWORKS, 2022, 153 : 427 - 443
  • [27] Micro-expression recognition algorithm based on graph convolutional network and Transformer model
    Wu J.
    Pang W.
    Wang L.
    Zhao B.
    High Technology Letters, 2023, 29 (02) : 213 - 222
  • [28] STA-GCN: Spatio-Temporal AU Graph Convolution Network for Facial Micro-expression Recognition
    Zhao, Xinhui
    Ma, Huimin
    Wang, Rongquan
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 80 - 91
  • [29] Micro-expression recognition algorithm based on graph convolutional network and Transformer model
    吴进
    PANG Wenting
    WANG Lei
    ZHAO Bo
    HighTechnologyLetters, 2023, 29 (02) : 213 - 222
  • [30] RETRACTED: LCBP-STGCN: A local cube binary pattern spatial temporal graph convolutional network for micro-expression recognition (Retracted Article)
    Wang, Yan
    Han, Jianfeng
    Guo, Ziqi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 1601 - 1611