MFDAN: Multi-Level Flow-Driven Attention Network for Micro-Expression Recognition

被引：1

作者：

Cai, Wenhao ^{[1
]}

Zhao, Junli ^{[1
]}

Yi, Ran ^{[2
]}

Yu, Minjing ^{[3
]}

Duan, Fuqing ^{[4
]}

Pan, Zhenkuan ^{[1
]}

Liu, Yong-Jin ^{[5
]}

机构：

[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao 266071, Peoples R China

[2] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China

[4] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China

[5] Tsinghua Univ, Dept Comp Sci & Technol, MOE Key Lab Pervas Comp, BNRist, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 12期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

Feature extraction; Optical flow; Deep learning; Data mining; Vectors; Emotion recognition; Circuits and systems; micro-expression recognition; attention mechanism;

D O I：

10.1109/TCSVT.2024.3437481

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Facial expressions are an essential part of human emotional communication, and micro-expressions (MEs), as transient and imperceptible non-verbal signals, can potentially reveal real human emotions. However, subtle motion variations, limited and unbalanced samples make micro-expression recognition (MER) challenging. In this paper, we design a novel dual-branch learning framework of multi-level flow-driven attention for micro-expression recognition (MFDAN), which innovatively integrates optical flow prior to guide the attention learning in the image encoding branch, enabling the model to focus on the most discriminative facial regions for subtle motion patterns. Firstly, we extract optical flow information by an optical flow encoding module. Then, in the image coding module, we construct a Transformer structure containing an optical flow-driven attention mechanism, which can effectively locate the interest region of micro-expressions in the image according to the position information of optical flow to capture more sensitive and fine-grained micro-expressions. By interoperating prior knowledge with data learning, and introducing the Dropkey operation and Focal Loss, our method can handle subtle micro-expression features on small imbalanced datasets. Through extensive experiments on three independent datasets and a composite database, including SMIC-HS, SAMM, and CASME II, robust leave-one-subject-out (LOSO) evaluation results show that our method outperforms state-of-the-art methods especially on the composite database.

引用

页码：12823 / 12836

页数：14

共 50 条

[31] Multi-channel Capsule Network for Micro-expression Recognition with Multiscale Fusion
Xie, Zhihua
Fan, Jiawei
Cheng, Shijia
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 76833 - 76850
[32] Multi-level feature fusion capsule network with self-attention for facial expression recognition
Huang, Zhiji
Yu, Songsen
Liang, Jun
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
[33] Micro-Expression Recognition Based on Optical Flow and PCANet
Wang, Shiqi
Guan, Suen
Lin, Hui
Huang, Jianming
Long, Fei
Yao, Junfeng
SENSORS, 2022, 22 (11)
[34] CMNet: Contrastive Magnification Network for Micro-Expression Recognition
Wei, Mengting
Jiang, Xingxun
Zheng, Wenming
Zong, Yuan
Lu, Cheng
Liu, Jiateng
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 119 - 127
[35] MLAN: Multi-Level Attention Network
Qin, Peinuan
Wang, Qinxuan
Zhang, Yue
Wei, Xueyao
Gao, Meiguo
IEEE ACCESS, 2022, 10 : 105437 - 105446
[36] Dual-Branch Cross-Attention Network for Micro-Expression Recognition with Transformer Variants
Xie, Zhihua
Zhao, Chuwei
ELECTRONICS, 2024, 13 (02)
[37] A Convolutional Neural Network for Compound Micro-Expression Recognition
Zhao, Yue
Xu, Jiancheng
SENSORS, 2019, 19 (24)
[38] SDGSA: a lightweight shallow dual-group symmetric attention network for micro-expression recognition
Yu, Zhengyang
Chen, Xiaojuan
Qu, Chang
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (06) : 8143 - 8162
[39] Multi-level Feature Fusion Facial Expression Recognition Network
Hu, Qian
Wu, Chengdong
Chi, Jianning
Yu, Xiaosheng
Wang, Huan
PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 5267 - 5272
[40] Multi-level spatial and semantic enhancement network for expression recognition
Ma, Yingdong
Wang, Xia
Wei, Lihua
APPLIED INTELLIGENCE, 2021, 51 (12) : 8565 - 8578

← 1 2 3 4 5 →