Disentangled Spectrum Variations Networks for NIR-VIS Face Recognition

被引:20
|
作者
Hu, Weipeng [1 ]
Hu, Haifeng [1 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510275, Peoples R China
基金
中国国家自然科学基金;
关键词
NIR-VIS face recognition; deep learning; adversarial training; disentangled spectrum variations; orthogonality constraint; REGRESSION;
D O I
10.1109/TMM.2019.2938685
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Surveillance cameras often capture near infrared images since it provides a low-cost and effective solution to acquire high-quality images under low-light environments. However, visual versus near infrared (VIS-NIR) heterogeneous face recognition (HFR) is still a challenging issue in computer vision community due to the gap between sensing patterns of different spectrums as well as the lack of sufficient training samples. To solve the above problem, in this paper, we present an effective Disentangled Spectrum Variations Networks (DSVNs) for VIS-NIR HFR. Two key strategies are introduced to the DSVNs for disentangling spectrum variations between two domains: Spectrum-adversarial Discriminative Feature Learning (SaDFL) and Step-wise Spectrum Orthogonal Decomposition (SSOD). The SaDFL consists of Identity-Discriminative subnetwork (IDNet) and Auxiliary Spectrum Adversarial subnetwork (ASANet). On the one hand, the IDNet is composed of a generator $G_H$ and a discriminator $D_U$ for extracting identity-discriminative feature. On the other hand, the ASANet is built by a generator $G_H$ and a discriminator $D_M$ for eliminating modality-variant spectrum information under the guidance of the discriminator $D_M$. The identity-label and modality-label HFR datasets are used to train the DSVNs with triplet loss. Both IDNet and ASANet can jointly enhance the domain-invariant feature representations via an adversarial learning. Furthermore, to disentangle spectrum variations effectively as well as making identity information and modality information unrelated to each other, we present a new topology of connection block called Disentangled Spectrum Variations (DSV). An orthogonality constraint is imposed to DSV at the convolution level for channel-wise orthogonal decomposition between the modality-invariant identity information and modality-variant spectrum information. In particular, the SSOD is built by stacking multiple modularized mirco-block DSV, and thereby enjoys the benefits of disentangling spectrum variation step by step. Moreover, we investigate the similarity calculation method to further improve the HFR performance. To sum up, the designed DSVNs leads to a purification of identity information as well as an elimination of modality information. Extensive experiments are carried out on two challenging NIR-VIS HFR datasets CASIA NIR-VIS 2.0 and Oulu-CASIA NIR-VIS, demonstrating the superiority of the proposed method.
引用
收藏
页码:1234 / 1248
页数:15
相关论文
共 50 条
  • [1] Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition
    Hu, Weipeng
    Hu, Haifeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 145 - 160
  • [2] Towards NIR-VIS Masked Face Recognition
    Du, Hang
    Shi, Hailin
    Liu, Yinglu
    Zeng, Dan
    Mei, Tao
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 768 - 772
  • [3] Dual Face Alignment Learning Network for NIR-VIS Face Recognition
    Hu, Weipeng
    Yan, Wenjun
    Hu, Haifeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2411 - 2424
  • [4] Syncretic Space Learning Network for NIR-VIS Face Recognition
    Yang, Yiming
    Hu, Weipeng
    Hu, Haifeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
  • [5] Learning Invariant Deep Representation for NIR-VIS Face Recognition
    He, Ran
    Wu, Xiang
    Sun, Zhenan
    Tan, Tieniu
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2000 - 2006
  • [6] Transferring Deep Representation for NIR-VIS Heterogeneous Face Recognition
    Liu, Xiaoxiang
    Song, Lingxiao
    Wu, Xiang
    Tan, Tieniu
    2016 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2016,
  • [7] Joint Feature Distribution Alignment Learning for NIR-VIS and VIS-VIS Face Recognition
    Miyamoto, Takaya
    Hashimoto, Hiroshi
    Hayasaka, Akihiro
    Ebihara, Akinori F.
    Imaoka, Hitoshi
    2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
  • [8] Adversarial Cross-Spectral Face Completion for NIR-VIS Face Recognition
    He, Ran
    Cao, Jie
    Song, Lingxiao
    Sun, Zhenan
    Tan, Tieniu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1025 - 1037
  • [9] Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition
    He, Ran
    Wu, Xiang
    Sun, Zhenan
    Tan, Tieniu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (07) : 1761 - 1773
  • [10] Partial NIR-VIS Heterogeneous Face Recognition With Automatic Saliency Search
    Luo, Mandi
    Ma, Xin
    Li, Zhihang
    Cao, Jie
    He, Ran
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 5003 - 5017