Masked Face Transformer

被引:0
|
作者
Zhao, Weisong [1 ,2 ]
Zhu, Xiangyu [3 ,4 ]
Guo, Kaiwen [3 ,4 ]
Shi, Haichao [1 ,2 ]
Zhang, Xiao-Yu [1 ,2 ]
Lei, Zhen [3 ,4 ,5 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 101408, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci UCAS, Sch Artificial Intelligence, Beijing 101408, Peoples R China
[5] Chinese Acad Sci, Hong Kong Inst Sci & Innovat, CAIR, Hong Kong, Peoples R China
关键词
Face recognition; Transformers; Feature extraction; Training; Task analysis; Costs; COVID-19; Masked face recognition; face recognition; transformer; RECOGNITION; ROBUST;
D O I
10.1109/TIFS.2023.3322600
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The COVID-19 pandemic makes wearing masks mandatory. Existing CNN-based face recognition (FR) systems suffer from severe performance degradation as masks occlude the vital facial regions. Recently, Vision Transformers have shown promising performance in various vision tasks with quadratic computation costs. Swin Transformer first proposes a successive window attention mechanism allowing the cross-window connection and more computational efficiency. Despite its potential, the deployment of Swin Transformer in masked face recognition encounters two challenges: 1) the attention range is insufficient to capture locally compatible face regions. 2) Masked face recognition can be defined as an occlusion-robust classification task with a known occlusion position, i.e., the position of the mask is minor-varying, which is overlooked but efficient in improving the model's recognition accuracy. To alleviate the above problem, we propose a Masked Face Transformer (MFT) with Masked Face-compatible Attention (MFA). The proposed MFA 1) introduces two additional window partition configurations, e.g., row shift and column shift, to enlarge the attention range in Swin with invariant computation costs, and 2) suppresses the interaction between the masked and non-masked regions to retain their discrepancies. Additionally, as mask occlusion leads to a separation between the masked and non-masked samples of the same identity, we propose to explore the relationship between them by a ClassFormer module to enhance intra-class aggregation. Extensive experiments show that MFT outperforms state-of-the-art masked face recognition methods in both simulated and real masked face testing datasets.
引用
收藏
页码:265 / 279
页数:15
相关论文
共 50 条
  • [31] Facial Biometric Identification in The Masked Face
    Ardiansyah
    Liliana, Dewi Yanti
    [J]. PROCEEDINGS OF 2021 13TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2021, : 129 - 133
  • [32] A Cascade Framework for Masked Face Detection
    Bu, Wei
    Xiao, Jiangjian
    Zhou, Chuanhong
    Yang, Minmin
    Peng, Chengbin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2017, : 458 - 462
  • [33] Masked Face Recognition Dataset and Application
    Wang, Zhongyuan
    Huang, Baojin
    Wang, Guangcheng
    Yi, Peng
    Jiang, Kui
    [J]. IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 5 (02): : 298 - 304
  • [34] Electrophysiological correlates of masked face priming
    Henson, R. N.
    Mouchlianitis, E.
    Matthews, W. J.
    Kouider, S.
    [J]. NEUROIMAGE, 2008, 40 (02) : 884 - 895
  • [35] Boosting Fairness for Masked Face Recognition
    Yu, Jun
    Hao, Xinlong
    Cui, Zeyu
    He, Peng
    Liu, Tongliang
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1531 - 1540
  • [36] GAN-based Face Reconstruction for Masked-Face
    Farahanipad, Farnaz
    Rezaei, Mohammad
    Nasr, Mohammadsadegh
    Kamangar, Farhad
    Athitsos, Vassilis
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2022, 2022, : 583 - 587
  • [37] Do Masked-Face Lineups Facilitate Eyewitness Identification of a Masked Individual?
    Manley, Krista D.
    Chan, Jason C. K.
    Wells, Gary L.
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 2019, 25 (03) : 396 - 409
  • [38] Masked and Adaptive Transformer for Exemplar Based Image Translation
    Jiang, Chang
    Gao, Fei
    Ma, Biao
    Lin, Yuhao
    Wang, Nannan
    Xu, Gang
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22418 - 22427
  • [39] Dual Branch Masked Transformer for Hyperspectral Image Classification
    Li, Kuo
    Chen, Yushi
    Huang, Lingbo
    [J]. IEEE Geoscience and Remote Sensing Letters, 2024, 21
  • [40] MFormer: Taming Masked Transformer for Unsupervised Spectral Reconstruction
    Li, Jiaojiao
    Leng, Yihong
    Song, Rui
    Liu, Wei
    Li, Yunsong
    Du, Qian
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61