Masked Face Transformer

被引:0
|
作者
Zhao, Weisong [1 ,2 ]
Zhu, Xiangyu [3 ,4 ]
Guo, Kaiwen [3 ,4 ]
Shi, Haichao [1 ,2 ]
Zhang, Xiao-Yu [1 ,2 ]
Lei, Zhen [3 ,4 ,5 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 101408, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci UCAS, Sch Artificial Intelligence, Beijing 101408, Peoples R China
[5] Chinese Acad Sci, Hong Kong Inst Sci & Innovat, CAIR, Hong Kong, Peoples R China
关键词
Face recognition; Transformers; Feature extraction; Training; Task analysis; Costs; COVID-19; Masked face recognition; face recognition; transformer; RECOGNITION; ROBUST;
D O I
10.1109/TIFS.2023.3322600
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The COVID-19 pandemic makes wearing masks mandatory. Existing CNN-based face recognition (FR) systems suffer from severe performance degradation as masks occlude the vital facial regions. Recently, Vision Transformers have shown promising performance in various vision tasks with quadratic computation costs. Swin Transformer first proposes a successive window attention mechanism allowing the cross-window connection and more computational efficiency. Despite its potential, the deployment of Swin Transformer in masked face recognition encounters two challenges: 1) the attention range is insufficient to capture locally compatible face regions. 2) Masked face recognition can be defined as an occlusion-robust classification task with a known occlusion position, i.e., the position of the mask is minor-varying, which is overlooked but efficient in improving the model's recognition accuracy. To alleviate the above problem, we propose a Masked Face Transformer (MFT) with Masked Face-compatible Attention (MFA). The proposed MFA 1) introduces two additional window partition configurations, e.g., row shift and column shift, to enlarge the attention range in Swin with invariant computation costs, and 2) suppresses the interaction between the masked and non-masked regions to retain their discrepancies. Additionally, as mask occlusion leads to a separation between the masked and non-masked samples of the same identity, we propose to explore the relationship between them by a ClassFormer module to enhance intra-class aggregation. Extensive experiments show that MFT outperforms state-of-the-art masked face recognition methods in both simulated and real masked face testing datasets.
引用
收藏
页码:265 / 279
页数:15
相关论文
共 50 条
  • [1] Learning 3D Face Representation with Vision Transformer for Masked Face Recognition
    Wang, Yuan
    Yang, Zhen
    Zhang, Zhiqiang
    Zang, Huaijuan
    Zhu, Qiang
    Zhan, Shu
    [J]. 2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 505 - 511
  • [2] Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing
    Yu, Zitong
    Cai, Rizhao
    Cui, Yawen
    Liu, Xin
    Hu, Yongjian
    Kot, Alex C.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
  • [3] MASKED FACE
    HONIGMANN, JJ
    [J]. ETHOS, 1977, 5 (03) : 263 - 280
  • [4] Masked Spiking Transformer
    Wang, Ziqing
    Fang, Yuetong
    Cao, Jiahang
    Zhang, Qiang
    Wang, Zhongrui
    Xu, Renjing
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1761 - 1771
  • [5] Swin-CasUNet: Cascaded U-Net with Swin Transformer for Masked Face Restoration
    Zeng, Chengbin
    Liu, Yi
    Song, Chunli
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 386 - 392
  • [6] Cultures of the (masked) face
    Marino, Gabriele
    [J]. SIGN SYSTEMS STUDIES, 2021, 49 (3-4) : 318 - 337
  • [7] The Masked Face of Constipation
    Rodriguez-Malave, Mary
    Gonzalez-Bravo, Diego
    Rodriguez-Gonzalez, Dante
    Rivera-Torres, Juan
    [J]. AMERICAN JOURNAL OF GASTROENTEROLOGY, 2020, 115 : S1783 - S1783
  • [8] EFFICIENT FACE ALIGNMENT NETWORK FOR MASKED FACE
    Sha, Yuyang
    Zhang, Jie
    Liu, Xiao
    Wu, Zhongqin
    Shan, Shiguang
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [9] Masked Transformer for Image Anomaly Localization
    De Nardin, Axel
    Mishra, Pankaj
    Foresti, Gian Luca
    Piciarelli, Claudio
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (07)
  • [10] A Benchmark on Masked Face Recognition
    Vidal, Pedro
    Granada, Roger Leitzke
    Fuhr, Gustavo
    Testoni, Vanessa
    Menotti, David
    [J]. 2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 204 - 209