A hierarchical multi-modal cross-attention model for face anti-spoofing

被引：1

作者：

Xue, Hao ^{[1
]}

Ma, Jing ^{[1
]}

Guo, Xiaoyu ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing 211106, Jiangsu, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2023年 / 97卷

基金：

中国国家自然科学基金;

关键词：

Facial recognition; Face anti-spoofing; Multi-modal; Feature fusion; Hierarchical feature extraction; Cross-attention;

D O I：

10.1016/j.jvcir.2023.103969

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Facial recognition has become popular in interactive systems as a means to authenticate identity. However, Facial recognition can be easily attacked illegally through face spoofing. In this paper, we propose a hierarchical multi-modal cross-attention model for face anti-spoofing, which can be flexibly applied in both single-modal and multi-modal scenarios. In order to map features among modalities thoroughly, we also design a novel attention mechanism, namely W-MSA-CA (Window-based Multihead Self-Attention and Cross Attention), which leverages both Multi-modal Multihead Self-Attention (MMSA) and Multi-modal Patch Cross attention (MPCA) to fuse multi-modal features. We test the proposed model on the public datasets and the results show that our model's capability to detect various types of spoofing is effective.

引用

页数：12

共 50 条

[21] A joint hierarchical cross-attention graph convolutional network for multi-modal facial expression recognition
Xu, Chujie
Du, Yong
Wang, Jingzi
Zheng, Wenjie
Li, Tiejun
Yuan, Zhansheng
[J]. COMPUTATIONAL INTELLIGENCE, 2024, 40 (01)
[22] Multi-modal Face Anti-spoofing Using Multi-fusion Network and Global Depth-wise Convolution
Zhou, Qian
Yang, Ming
Chen, Shidong
Yan, Hongzheng
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[23] IS CROSS-ATTENTION PREFERABLE TO SELF-ATTENTION FOR MULTI-MODAL EMOTION RECOGNITION?
Rajan, Vandana
Brutti, Alessio
Cavallaro, Andrea
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4693 - 4697
[24] Multiscale Residual Gradient Attention for Face Anti-Spoofing
Zhu, Shiwei
Xiang, Shijun
[J]. SSRN, 2022,
[25] Multiscale residual gradient attention for face anti-spoofing✩
Zhu, Shiwei
Xiang, Shijun
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
[26] CA-MoEiT: Generalizable Face Anti-spoofing via Dual Cross-Attention and Semi-fixed Mixture-of-Expert
Liu, Ajian
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 5439 - 5452
[27] Multi-modal cross-attention network for Alzheimer's disease diagnosis with multi data
Zhang, Jin
He, Xiaohai
Liu, Yan
Cai, Qingyan
Chen, Honggang
Qing, Linbo
[J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 162
[28] Multi-Level Multi-Modal Cross-Attention Network for Fake News Detection
Ying, Long
Yu, Hui
Wang, Jinguang
Ji, Yongze
Qian, Shengsheng
[J]. IEEE ACCESS, 2021, 9 : 132363 - 132373
[29] Dynamic Attention based Domain Generalization for Face Anti-Spoofing
Zhang, Sheng
Gao, Zhibin
Lin, Yunhao
Lu, Yuhang
Huang, Lianfen
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3413 - 3421
[30] Recognizing Multi-Modal Face Spoofing with Face Recognition Networks
Parkin, Aleksandr
Grinchuk, Oleg
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1617 - 1623

← 1 2 3 4 5 →