Temporal-spatial correlation and graph attention-guided network for micro-expression recognition in English learning livestreams

被引:0
|
作者
Zhao, Hongxin [1 ]
Kim, Byung-Gyu [2 ]
Slowik, Adam [3 ]
Pan, Daohua [4 ]
机构
[1] Tianjin Univ Finance & Econ, Pearl River Coll, Tianjin 301811, Peoples R China
[2] Sookmyung Womens Univ, Dept Artificial Intelligence Engn, Seoul, South Korea
[3] Koszalin Univ Technol, Coll Comp Sci, Koszalin, Poland
[4] Heilongjiang Vocat Coll Nationalities, Dept Informat Management, Harbin 150066, Peoples R China
关键词
Micro-expression recognition; English learning livestreams; Spatio-temporal graph convolution; Transformer;
D O I
10.1007/s10791-024-09477-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Micro-expressions, fleeting facial movements lasting 1/25 to 1/3 of a second, offer crucial insights into genuine emotions, particularly valuable in online education settings. The rapid growth of English learning livestreams has heightened the need for accurate, real-time micro-expression recognition to enhance learner engagement and instructional effectiveness. However, existing methods need help with the subtle nature of these expressions, especially in dynamic, low-resolution streaming environments. This paper presents TSG-MER-ELL, a novel end-to-end network for micro-expression recognition in English learning livestreams, integrating temporal-spatial correlation and graph attention mechanisms. The framework addresses the unique challenges of real-time emotion analysis in online language education, where subtle facial cues are crucial in understanding learner engagement and comprehension. The temporal-spatial correlation module employs action units with spatio-temporal graph convolution to aggregate features from diverse facial regions, while transformer encoders construct long-range correlations. The graph attention module builds upon local facial areas to guide self-attention computations, yielding precise local correlation features. These global and local features are fused for the final micro-expression classification. We introduce an adaptive loss function that balances accuracy, efficiency, and relevance to linguistic context. Extensive experiments on SMIC, CASME II, and SAMM datasets, adapted for English learning scenarios, demonstrate TSG-MER-ELL's superior performance over ten state-of-the-art baselines. The TSG-MER-ELL framework achieves top UF1 and UAR scores across all datasets, significantly improving recognition speed and accuracy. Ablation studies and visualizations of temporal-spatial features and graph attention weights provide insights into the framework's effectiveness in capturing subtle emotional cues. TSG-MER-ELL's robust performance in varied online learning conditions highlights its potential to enhance engagement, personalize instruction, and improve overall outcomes in virtual English language education.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] MCNet: meta-clustering learning network for micro-expression recognition
    Wang, Ziqi
    Fu, Wenwen
    Zhang, Yue
    Li, Jiarui
    Gong, Wenjuan
    Gonzalez, Jordi
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
  • [32] Micro-Expression Recognition Method Based on Spatial Attention Mechanism and Optical Flow Features
    Liu D.
    Liang Z.
    Sun Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (10): : 1541 - 1552
  • [33] TSMGA: Temporal-Spatial Multiscale Graph Attention Network for Remote Sensing Change Detection
    Zhang, Xiaoyang
    Yuan, Genji
    Hua, Zhen
    Li, Jinjiang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 3696 - 3712
  • [34] Micro-Expression Recognition Enhanced by Macro-Expression from Spatial-Temporal Domain
    Xia, Bin
    Wang, Shangfei
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1186 - 1193
  • [35] Speech Emotion Recognition Based on Temporal-Spatial Learnable Graph Convolutional Neural Network
    Yan, Jingjie
    Li, Haihua
    Xu, Fengfeng
    Zhou, Xiaoyang
    Liu, Ying
    Yang, Yuan
    ELECTRONICS, 2024, 13 (11)
  • [36] Multi-scale fusion visual attention network for facial micro-expression recognition
    Pan, Hang
    Yang, Hongling
    Xie, Lun
    Wang, Zhiliang
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [37] Spatiotemporal Convolutional Neural Network with Convolutional Block Attention Module for Micro-Expression Recognition
    Chen, Boyu
    Zhang, Zhihao
    Liu, Nian
    Tan, Yang
    Liu, Xinyu
    Chen, Tong
    INFORMATION, 2020, 11 (08)
  • [38] Shallow multi-branch attention convolutional neural network for micro-expression recognition
    Wang, Gang
    Huang, Shucheng
    Tao, Zhe
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 1967 - 1980
  • [39] Shallow multi-branch attention convolutional neural network for micro-expression recognition
    Gang Wang
    Shucheng Huang
    Zhe Tao
    Multimedia Systems, 2023, 29 : 1967 - 1980
  • [40] Athlete facial micro-expression recognition method based on graph convolutional neural network
    Xu, Haochen
    Zhu, Zhiqiang
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2024, 16 (05) : 478 - 496