Disentangled Spectrum Variations Networks for NIR-VIS Face Recognition

被引：20

作者：

Hu, Weipeng ^{[1
]}

Hu, Haifeng ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510275, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2020年 / 22卷 / 05期

基金：

中国国家自然科学基金;

关键词：

NIR-VIS face recognition; deep learning; adversarial training; disentangled spectrum variations; orthogonality constraint; REGRESSION;

D O I：

10.1109/TMM.2019.2938685

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Surveillance cameras often capture near infrared images since it provides a low-cost and effective solution to acquire high-quality images under low-light environments. However, visual versus near infrared (VIS-NIR) heterogeneous face recognition (HFR) is still a challenging issue in computer vision community due to the gap between sensing patterns of different spectrums as well as the lack of sufficient training samples. To solve the above problem, in this paper, we present an effective Disentangled Spectrum Variations Networks (DSVNs) for VIS-NIR HFR. Two key strategies are introduced to the DSVNs for disentangling spectrum variations between two domains: Spectrum-adversarial Discriminative Feature Learning (SaDFL) and Step-wise Spectrum Orthogonal Decomposition (SSOD). The SaDFL consists of Identity-Discriminative subnetwork (IDNet) and Auxiliary Spectrum Adversarial subnetwork (ASANet). On the one hand, the IDNet is composed of a generator $G_H$ and a discriminator $D_U$ for extracting identity-discriminative feature. On the other hand, the ASANet is built by a generator $G_H$ and a discriminator $D_M$ for eliminating modality-variant spectrum information under the guidance of the discriminator $D_M$. The identity-label and modality-label HFR datasets are used to train the DSVNs with triplet loss. Both IDNet and ASANet can jointly enhance the domain-invariant feature representations via an adversarial learning. Furthermore, to disentangle spectrum variations effectively as well as making identity information and modality information unrelated to each other, we present a new topology of connection block called Disentangled Spectrum Variations (DSV). An orthogonality constraint is imposed to DSV at the convolution level for channel-wise orthogonal decomposition between the modality-invariant identity information and modality-variant spectrum information. In particular, the SSOD is built by stacking multiple modularized mirco-block DSV, and thereby enjoys the benefits of disentangling spectrum variation step by step. Moreover, we investigate the similarity calculation method to further improve the HFR performance. To sum up, the designed DSVNs leads to a purification of identity information as well as an elimination of modality information. Extensive experiments are carried out on two challenging NIR-VIS HFR datasets CASIA NIR-VIS 2.0 and Oulu-CASIA NIR-VIS, demonstrating the superiority of the proposed method.

引用

页码：1234 / 1248

页数：15

共 50 条

[1] Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition
Hu, Weipeng
Hu, Haifeng
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 145 - 160
[2] Towards NIR-VIS Masked Face Recognition
Du, Hang
Shi, Hailin
Liu, Yinglu
Zeng, Dan
Mei, Tao
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 768 - 772
[3] Dual Face Alignment Learning Network for NIR-VIS Face Recognition
Hu, Weipeng
Yan, Wenjun
Hu, Haifeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2411 - 2424
[4] Syncretic Space Learning Network for NIR-VIS Face Recognition
Yang, Yiming
Hu, Weipeng
Hu, Haifeng
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
[5] Learning Invariant Deep Representation for NIR-VIS Face Recognition
He, Ran
Wu, Xiang
Sun, Zhenan
Tan, Tieniu
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2000 - 2006
[6] Transferring Deep Representation for NIR-VIS Heterogeneous Face Recognition
Liu, Xiaoxiang
Song, Lingxiao
Wu, Xiang
Tan, Tieniu
2016 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2016,
[7] Joint Feature Distribution Alignment Learning for NIR-VIS and VIS-VIS Face Recognition
Miyamoto, Takaya
Hashimoto, Hiroshi
Hayasaka, Akihiro
Ebihara, Akinori F.
Imaoka, Hitoshi
2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
[8] Adversarial Cross-Spectral Face Completion for NIR-VIS Face Recognition
He, Ran
Cao, Jie
Song, Lingxiao
Sun, Zhenan
Tan, Tieniu
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1025 - 1037
[9] Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition
He, Ran
Wu, Xiang
Sun, Zhenan
Tan, Tieniu
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (07) : 1761 - 1773
[10] Partial NIR-VIS Heterogeneous Face Recognition With Automatic Saliency Search
Luo, Mandi
Ma, Xin
Li, Zhihang
Cao, Jie
He, Ran
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 5003 - 5017

← 1 2 3 4 5 →