Learning a Robust Synthetic Modality with Dual-Level Alignment for Visible-Infrared Person Re-identification

被引:0
|
作者
Wang, Zichun [1 ]
Cheng, Xu [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Visible-infrared person re-identification; Synthetic modality; Hetero-modality fusion; Dual-level alignment;
D O I
10.1007/978-981-97-8620-6_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visible-Infrared Person Re-identification (VI-ReID) is a challenging task that involves matching visible and infrared person images across multiple camera views. The huge gap between visible and infrared modalities has become a significant bottleneck. Existing works typically employ dual-stream networks to extract shared modality representation, yet struggle to relieve such gap, resulting in inferior performance. To overcome this issue, we propose a robust synthetic modality learning with a dual-level alignment method (RSDL) for VI-ReID that aims to generate a robust synthetic modality as a bridge to guide cross-modality alignment. Specifically, the hetero-modality fusion (HMF) strategy is introduced to generate the robust synthetic modality by using multi-scale feature fusion with a structure rebuild module (SRM) and a cross-modality spatial alignment (CSA) module. The strategy incorporates rich semantic structural patterns from visible and infrared images to handle the modality variation. Additionally, we design the dual-level regulation loss to jointly explore the stable feature relationships among three modalities at both the instance and distribution levels for cross-modality alignment. This facilitates discovering modality-consistent and identity-aware representations. Extensive experiments on three VI-ReID benchmarks demonstrate the effectiveness of our proposed method.
引用
收藏
页码:289 / 303
页数:15
相关论文
共 50 条
  • [31] A guidance and alignment transformer model for visible-infrared person re-identification
    Huang, Linyu
    Xue, Zijie
    Ning, Qian
    Guo, Yong
    Li, Yongsheng
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [32] Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification
    Liang, Tengfei
    Jin, Yi
    Liu, Wu
    Li, Yidong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8432 - 8444
  • [33] Occluded Visible-Infrared Person Re-Identification
    Feng, Yujian
    Ji, Yimu
    Wu, Fei
    Gao, Guangwei
    Gao, Yang
    Liu, Tianliang
    Liu, Shangdong
    Jing, Xiao-Yuan
    Luo, Jiebo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1401 - 1413
  • [34] FMCNet plus : Feature-Level Modality Compensation for Visible-Infrared Person Re-Identification
    Xi, Ruida
    Huang, Nianchang
    Lai, Changzhou
    Zhang, Qiang
    Han, Jungong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [35] Tri-Level Modality-Information Disentanglement for Visible-Infrared Person Re-Identification
    Lu, Zefeng
    Lin, Ronghao
    Hu, Haifeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2700 - 2714
  • [36] Modality Blur and Batch Alignment Learning for Twin Noisy Labels-based Visible-infrared Person Re-identification
    Wu, Song
    Shan, Shihao
    Xiao, Guoqiang
    Lew, Michael S.
    Gao, Xinbo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [37] Learning Progressive Modality-Shared Transformers for Effective Visible-Infrared Person Re-identification
    Lu, Hu
    Zou, Xuezhang
    Zhang, Pingping
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1835 - 1843
  • [38] Adversarial Decoupling and Modality-Invariant Representation Learning for Visible-Infrared Person Re-Identification
    Hu, Weipeng
    Liu, Bohong
    Zeng, Haitang
    Hou, Yanke
    Hu, Haifeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5095 - 5109
  • [39] Dual-attentive cascade clustering learning for visible-infrared person re-identification
    Wang, Xianju
    Chen, Cuiqun
    Zhu, Yong
    Chen, Shuguang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19729 - 19746
  • [40] Visible-Infrared Person Re-identification via Modality Augmentation and Center Constraints
    Chen, Qiang
    Xiao, Guoqiang
    Wu, Jiahao
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 221 - 232