A hierarchical multi-modal cross-attention model for face anti-spoofing

被引:1
|
作者
Xue, Hao [1 ]
Ma, Jing [1 ]
Guo, Xiaoyu [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing 211106, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Facial recognition; Face anti-spoofing; Multi-modal; Feature fusion; Hierarchical feature extraction; Cross-attention;
D O I
10.1016/j.jvcir.2023.103969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Facial recognition has become popular in interactive systems as a means to authenticate identity. However, Facial recognition can be easily attacked illegally through face spoofing. In this paper, we propose a hierarchical multi-modal cross-attention model for face anti-spoofing, which can be flexibly applied in both single-modal and multi-modal scenarios. In order to map features among modalities thoroughly, we also design a novel attention mechanism, namely W-MSA-CA (Window-based Multihead Self-Attention and Cross Attention), which leverages both Multi-modal Multihead Self-Attention (MMSA) and Multi-modal Patch Cross attention (MPCA) to fuse multi-modal features. We test the proposed model on the public datasets and the results show that our model's capability to detect various types of spoofing is effective.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A joint hierarchical cross-attention graph convolutional network for multi-modal facial expression recognition
    Xu, Chujie
    Du, Yong
    Wang, Jingzi
    Zheng, Wenjie
    Li, Tiejun
    Yuan, Zhansheng
    [J]. COMPUTATIONAL INTELLIGENCE, 2024, 40 (01)
  • [22] Multi-modal Face Anti-spoofing Using Multi-fusion Network and Global Depth-wise Convolution
    Zhou, Qian
    Yang, Ming
    Chen, Shidong
    Yan, Hongzheng
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [23] IS CROSS-ATTENTION PREFERABLE TO SELF-ATTENTION FOR MULTI-MODAL EMOTION RECOGNITION?
    Rajan, Vandana
    Brutti, Alessio
    Cavallaro, Andrea
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4693 - 4697
  • [24] Multiscale Residual Gradient Attention for Face Anti-Spoofing
    Zhu, Shiwei
    Xiang, Shijun
    [J]. SSRN, 2022,
  • [25] Multiscale residual gradient attention for face anti-spoofing✩
    Zhu, Shiwei
    Xiang, Shijun
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [27] Multi-modal cross-attention network for Alzheimer's disease diagnosis with multi data
    Zhang, Jin
    He, Xiaohai
    Liu, Yan
    Cai, Qingyan
    Chen, Honggang
    Qing, Linbo
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 162
  • [28] Multi-Level Multi-Modal Cross-Attention Network for Fake News Detection
    Ying, Long
    Yu, Hui
    Wang, Jinguang
    Ji, Yongze
    Qian, Shengsheng
    [J]. IEEE ACCESS, 2021, 9 : 132363 - 132373
  • [29] Dynamic Attention based Domain Generalization for Face Anti-Spoofing
    Zhang, Sheng
    Gao, Zhibin
    Lin, Yunhao
    Lu, Yuhang
    Huang, Lianfen
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3413 - 3421
  • [30] Recognizing Multi-Modal Face Spoofing with Face Recognition Networks
    Parkin, Aleksandr
    Grinchuk, Oleg
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1617 - 1623