MFDAN: Multi-Level Flow-Driven Attention Network for Micro-Expression Recognition

被引:1
|
作者
Cai, Wenhao [1 ]
Zhao, Junli [1 ]
Yi, Ran [2 ]
Yu, Minjing [3 ]
Duan, Fuqing [4 ]
Pan, Zhenkuan [1 ]
Liu, Yong-Jin [5 ]
机构
[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao 266071, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[4] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China
[5] Tsinghua Univ, Dept Comp Sci & Technol, MOE Key Lab Pervas Comp, BNRist, Beijing 100084, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Feature extraction; Optical flow; Deep learning; Data mining; Vectors; Emotion recognition; Circuits and systems; micro-expression recognition; attention mechanism;
D O I
10.1109/TCSVT.2024.3437481
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Facial expressions are an essential part of human emotional communication, and micro-expressions (MEs), as transient and imperceptible non-verbal signals, can potentially reveal real human emotions. However, subtle motion variations, limited and unbalanced samples make micro-expression recognition (MER) challenging. In this paper, we design a novel dual-branch learning framework of multi-level flow-driven attention for micro-expression recognition (MFDAN), which innovatively integrates optical flow prior to guide the attention learning in the image encoding branch, enabling the model to focus on the most discriminative facial regions for subtle motion patterns. Firstly, we extract optical flow information by an optical flow encoding module. Then, in the image coding module, we construct a Transformer structure containing an optical flow-driven attention mechanism, which can effectively locate the interest region of micro-expressions in the image according to the position information of optical flow to capture more sensitive and fine-grained micro-expressions. By interoperating prior knowledge with data learning, and introducing the Dropkey operation and Focal Loss, our method can handle subtle micro-expression features on small imbalanced datasets. Through extensive experiments on three independent datasets and a composite database, including SMIC-HS, SAMM, and CASME II, robust leave-one-subject-out (LOSO) evaluation results show that our method outperforms state-of-the-art methods especially on the composite database.
引用
收藏
页码:12823 / 12836
页数:14
相关论文
共 50 条
  • [31] Multi-channel Capsule Network for Micro-expression Recognition with Multiscale Fusion
    Xie, Zhihua
    Fan, Jiawei
    Cheng, Shijia
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 76833 - 76850
  • [32] Multi-level feature fusion capsule network with self-attention for facial expression recognition
    Huang, Zhiji
    Yu, Songsen
    Liang, Jun
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
  • [33] Micro-Expression Recognition Based on Optical Flow and PCANet
    Wang, Shiqi
    Guan, Suen
    Lin, Hui
    Huang, Jianming
    Long, Fei
    Yao, Junfeng
    SENSORS, 2022, 22 (11)
  • [34] CMNet: Contrastive Magnification Network for Micro-Expression Recognition
    Wei, Mengting
    Jiang, Xingxun
    Zheng, Wenming
    Zong, Yuan
    Lu, Cheng
    Liu, Jiateng
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 119 - 127
  • [35] MLAN: Multi-Level Attention Network
    Qin, Peinuan
    Wang, Qinxuan
    Zhang, Yue
    Wei, Xueyao
    Gao, Meiguo
    IEEE ACCESS, 2022, 10 : 105437 - 105446
  • [36] Dual-Branch Cross-Attention Network for Micro-Expression Recognition with Transformer Variants
    Xie, Zhihua
    Zhao, Chuwei
    ELECTRONICS, 2024, 13 (02)
  • [37] A Convolutional Neural Network for Compound Micro-Expression Recognition
    Zhao, Yue
    Xu, Jiancheng
    SENSORS, 2019, 19 (24)
  • [38] SDGSA: a lightweight shallow dual-group symmetric attention network for micro-expression recognition
    Yu, Zhengyang
    Chen, Xiaojuan
    Qu, Chang
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (06) : 8143 - 8162
  • [39] Multi-level Feature Fusion Facial Expression Recognition Network
    Hu, Qian
    Wu, Chengdong
    Chi, Jianning
    Yu, Xiaosheng
    Wang, Huan
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 5267 - 5272
  • [40] Multi-level spatial and semantic enhancement network for expression recognition
    Ma, Yingdong
    Wang, Xia
    Wei, Lihua
    APPLIED INTELLIGENCE, 2021, 51 (12) : 8565 - 8578