Presentation attack detection based on two-stream vision transformers with self-attention fusion

被引:7
|
作者
Peng, Fei [1 ]
Meng, Shao-hua [1 ]
Long, Min [2 ]
机构
[1] Hunan Univ, Sch Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
基金
中国国家自然科学基金;
关键词
Presentation attack detection; Multi-scale retinex with color restoration; Vision transformer; Deep learning; Feature fusion; FACE; RETINEX;
D O I
10.1016/j.jvcir.2022.103518
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aiming at the performance degradation of the existing presentation attack detection methods due to the illumination variation, a two-stream vision transformers framework (TSViT) based on transfer learning in two complementary spaces is proposed in this paper. The face images of RGB color space and multi-scale retinex with color restoration (MSRCR) space are fed to TSViT to learn the distinguishing features of presentation attack detection. To effectively fuse features from two sources (RGB color space images and MSRCR images), a feature fusion method based on self-attention is built, which can effectively capture the complementarity of two features. Experiments and analysis on Oulu-NPU , CASIA-MFSD , and Replay-Attack databases show that it outperforms most existing methods in intra-database testing and achieves good generalization performance in cross-database testing.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Remote Sensing Image Fusion Algorithm Based on Two-Stream Fusion Network and Residual Channel Attention Mechanism
    Huang, Mengxing
    Liu, Shi
    Li, Zhenfeng
    Feng, Siling
    Wu, Di
    Wu, Yuanyuan
    Shu, Feng
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [22] Pornographic Video Detection with Convolutional Two-Stream Network Fusion
    Lee, Wonjae
    Kim, Junghak
    Lee, Nam Kyung
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1273 - 1275
  • [23] Pedestrian Detection Based on Two-Stream UDN
    Wang, Wentong
    Wang, Lichun
    Ge, Xufei
    Li, Jinghua
    Yin, Baocai
    APPLIED SCIENCES-BASEL, 2020, 10 (05):
  • [24] Two-stream Convolutional Self-attention Model for Leakage Aperture Equivalent Prediction of High-pressure Gas Pipeline
    Dong, Hongyang
    Liu, Tao
    Jiang, Dong
    Li, Huadong
    Zhao, Dongmei
    PROCEEDINGS OF THE 2024 3RD INTERNATIONAL SYMPOSIUM ON INTELLIGENT UNMANNED SYSTEMS AND ARTIFICIAL INTELLIGENCE, SIUSAI 2024, 2024, : 231 - 238
  • [25] Remote Sensing Image Fusion Based on Two-Stream Fusion Network
    Liu, Xiangyu
    Wang, Yunhong
    Liu, Qingjie
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 428 - 439
  • [26] Remote sensing image fusion based on two-stream fusion network
    Liu, Xiangyu
    Liu, Qingjie
    Wang, Yunhong
    INFORMATION FUSION, 2020, 55 : 1 - 15
  • [27] Vision Transformer Based on Reconfigurable Gaussian Self-attention
    Zhao L.
    Zhou J.-K.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (09): : 1976 - 1988
  • [28] A SYN Flood Attack Detection Method Based on Hierarchical Multihead Self-Attention Mechanism
    Guo, Xiaojun
    Gao, Xuan
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [29] Multi-Level Two-Stream Fusion-Based Spatio-Temporal Attention Model for Violence Detection and Localization
    Asad, Mujtaba
    Jiang, He
    Yang, Jie
    Tu, Enmei
    Malik, Aftab A.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [30] Regularizing self-attention on vision transformers with 2D spatial distance loss
    Luiz H. Mormille
    Clifford Broni-Bediako
    Masayasu Atsumi
    Artificial Life and Robotics, 2022, 27 : 586 - 593