Masked Face Transformer

被引：0

作者：

Zhao, Weisong ^{[1
,2
]}

Zhu, Xiangyu ^{[3
,4
]}

Guo, Kaiwen ^{[3
,4
]}

Shi, Haichao ^{[1
,2
]}

Zhang, Xiao-Yu ^{[1
,2
]}

Lei, Zhen ^{[3
,4
,5
]}

机构：

[1] Chinese Acad Sci, Inst Informat Engn, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 101408, Peoples R China

[3] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China

[4] Univ Chinese Acad Sci UCAS, Sch Artificial Intelligence, Beijing 101408, Peoples R China

[5] Chinese Acad Sci, Hong Kong Inst Sci & Innovat, CAIR, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY | 2024年 / 19卷

关键词：

Face recognition; Transformers; Feature extraction; Training; Task analysis; Costs; COVID-19; Masked face recognition; face recognition; transformer; RECOGNITION; ROBUST;

D O I：

10.1109/TIFS.2023.3322600

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The COVID-19 pandemic makes wearing masks mandatory. Existing CNN-based face recognition (FR) systems suffer from severe performance degradation as masks occlude the vital facial regions. Recently, Vision Transformers have shown promising performance in various vision tasks with quadratic computation costs. Swin Transformer first proposes a successive window attention mechanism allowing the cross-window connection and more computational efficiency. Despite its potential, the deployment of Swin Transformer in masked face recognition encounters two challenges: 1) the attention range is insufficient to capture locally compatible face regions. 2) Masked face recognition can be defined as an occlusion-robust classification task with a known occlusion position, i.e., the position of the mask is minor-varying, which is overlooked but efficient in improving the model's recognition accuracy. To alleviate the above problem, we propose a Masked Face Transformer (MFT) with Masked Face-compatible Attention (MFA). The proposed MFA 1) introduces two additional window partition configurations, e.g., row shift and column shift, to enlarge the attention range in Swin with invariant computation costs, and 2) suppresses the interaction between the masked and non-masked regions to retain their discrepancies. Additionally, as mask occlusion leads to a separation between the masked and non-masked samples of the same identity, we propose to explore the relationship between them by a ClassFormer module to enhance intra-class aggregation. Extensive experiments show that MFT outperforms state-of-the-art masked face recognition methods in both simulated and real masked face testing datasets.

引用

页码：265 / 279

页数：15

共 50 条

[1] Learning 3D Face Representation with Vision Transformer for Masked Face Recognition
Wang, Yuan
Yang, Zhen
Zhang, Zhiqiang
Zang, Huaijuan
Zhu, Qiang
Zhan, Shu
[J]. 2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 505 - 511
[2] Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing
Yu, Zitong
Cai, Rizhao
Cui, Yawen
Liu, Xin
Hu, Yongjian
Kot, Alex C.
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
[3] MASKED FACE
HONIGMANN, JJ
[J]. ETHOS, 1977, 5 (03) : 263 - 280
[4] Masked Spiking Transformer
Wang, Ziqing
Fang, Yuetong
Cao, Jiahang
Zhang, Qiang
Wang, Zhongrui
Xu, Renjing
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1761 - 1771
[5] Swin-CasUNet: Cascaded U-Net with Swin Transformer for Masked Face Restoration
Zeng, Chengbin
Liu, Yi
Song, Chunli
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 386 - 392
[6] Cultures of the (masked) face
Marino, Gabriele
[J]. SIGN SYSTEMS STUDIES, 2021, 49 (3-4) : 318 - 337
[7] The Masked Face of Constipation
Rodriguez-Malave, Mary
Gonzalez-Bravo, Diego
Rodriguez-Gonzalez, Dante
Rivera-Torres, Juan
[J]. AMERICAN JOURNAL OF GASTROENTEROLOGY, 2020, 115 : S1783 - S1783
[8] EFFICIENT FACE ALIGNMENT NETWORK FOR MASKED FACE
Sha, Yuyang
Zhang, Jie
Liu, Xiao
Wu, Zhongqin
Shan, Shiguang
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
[9] Masked Transformer for Image Anomaly Localization
De Nardin, Axel
Mishra, Pankaj
Foresti, Gian Luca
Piciarelli, Claudio
[J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (07)
[10] A Benchmark on Masked Face Recognition
Vidal, Pedro
Granada, Roger Leitzke
Fuhr, Gustavo
Testoni, Vanessa
Menotti, David
[J]. 2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 204 - 209

← 1 2 3 4 5 →