Disentangled Spectrum Variations Networks for NIR-VIS Face Recognition

被引:20
|
作者
Hu, Weipeng [1 ]
Hu, Haifeng [1 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510275, Peoples R China
基金
中国国家自然科学基金;
关键词
NIR-VIS face recognition; deep learning; adversarial training; disentangled spectrum variations; orthogonality constraint; REGRESSION;
D O I
10.1109/TMM.2019.2938685
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Surveillance cameras often capture near infrared images since it provides a low-cost and effective solution to acquire high-quality images under low-light environments. However, visual versus near infrared (VIS-NIR) heterogeneous face recognition (HFR) is still a challenging issue in computer vision community due to the gap between sensing patterns of different spectrums as well as the lack of sufficient training samples. To solve the above problem, in this paper, we present an effective Disentangled Spectrum Variations Networks (DSVNs) for VIS-NIR HFR. Two key strategies are introduced to the DSVNs for disentangling spectrum variations between two domains: Spectrum-adversarial Discriminative Feature Learning (SaDFL) and Step-wise Spectrum Orthogonal Decomposition (SSOD). The SaDFL consists of Identity-Discriminative subnetwork (IDNet) and Auxiliary Spectrum Adversarial subnetwork (ASANet). On the one hand, the IDNet is composed of a generator $G_H$ and a discriminator $D_U$ for extracting identity-discriminative feature. On the other hand, the ASANet is built by a generator $G_H$ and a discriminator $D_M$ for eliminating modality-variant spectrum information under the guidance of the discriminator $D_M$. The identity-label and modality-label HFR datasets are used to train the DSVNs with triplet loss. Both IDNet and ASANet can jointly enhance the domain-invariant feature representations via an adversarial learning. Furthermore, to disentangle spectrum variations effectively as well as making identity information and modality information unrelated to each other, we present a new topology of connection block called Disentangled Spectrum Variations (DSV). An orthogonality constraint is imposed to DSV at the convolution level for channel-wise orthogonal decomposition between the modality-invariant identity information and modality-variant spectrum information. In particular, the SSOD is built by stacking multiple modularized mirco-block DSV, and thereby enjoys the benefits of disentangling spectrum variation step by step. Moreover, we investigate the similarity calculation method to further improve the HFR performance. To sum up, the designed DSVNs leads to a purification of identity information as well as an elimination of modality information. Extensive experiments are carried out on two challenging NIR-VIS HFR datasets CASIA NIR-VIS 2.0 and Oulu-CASIA NIR-VIS, demonstrating the superiority of the proposed method.
引用
收藏
页码:1234 / 1248
页数:15
相关论文
共 50 条
  • [31] LAMP-HQ: A Large-Scale Multi-pose High-Quality Database and Benchmark for NIR-VIS Face Recognition
    Yu, Aijing
    Wu, Haoxue
    Huang, Huaibo
    Lei, Zhen
    He, Ran
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (05) : 1467 - 1483
  • [32] NIR-VIS REFLECTIVITY SPECTRA OF SOME TRANSITION-METAL THIOPHOSPHATES
    GRASSO, V
    NERI, F
    SILIPIGNI, L
    PIACENTINI, M
    NUOVO CIMENTO DELLA SOCIETA ITALIANA DI FISICA D-CONDENSED MATTER ATOMIC MOLECULAR AND CHEMICAL PHYSICS FLUIDS PLASMAS BIOPHYSICS, 1991, 13 (05): : 633 - 645
  • [33] Fabrication and Characterization Study of Porous Silicon for NIR-VIS Photodetector Applications
    Ali, Rusul Hamoud Abd
    Jabbar, Mushtak A.
    Abd, Ahmed N.
    INTERNATIONAL JOURNAL OF NANOSCIENCE, 2024, 23 (01)
  • [34] The cation and the anion vacancies in cadmium diphosphide: A NIR-VIS, IR, and Raman study
    Shportko, K. V.
    VIBRATIONAL SPECTROSCOPY, 2017, 92 : 230 - 233
  • [35] Matching NIR Face to VIS Face Using Transduction
    Zhu, Jun-Yong
    Zheng, Wei-Shi
    Lai, Jian-Huang
    Li, Stan Z.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (03) : 501 - 514
  • [36] NIR-to-VIS Face Recognition via Embattling Relations and Coordinates of the Pairwise Features
    Cho, MyeongAh
    Chung, Tae-young
    Kim, Taeoh
    Lee, Sangyoun
    2019 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2019,
  • [37] TRANSDUCTIVE VIS-NIR FACE MATCHING
    Zhu, Jun-Yong
    Zheng, Wei-Shi
    Lai, Jianhuang
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1437 - 1440
  • [38] Disentangled Variational Representation for Heterogeneous Face Recognition
    Wu, Xiang
    Huang, Huaibo
    Patel, Vishal M.
    He, Ran
    Sun, Zhenan
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9005 - 9012
  • [39] Cross Spectral Disparity Estimation From VIS and NIR Paired Images Using Disentangled Representation and Reversible Neural Networks
    Han, Qihui
    Jung, Cheolkon
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 5326 - 5336
  • [40] Parallel-Structure-based Transfer Learning for Deep NIR-to-VIS Face Recognition
    Wang, Yufei
    Li, Yali
    Wang, Shengjin
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 146 - 156