Learning a Robust Synthetic Modality with Dual-Level Alignment for Visible-Infrared Person Re-identification

被引:0
|
作者
Wang, Zichun [1 ]
Cheng, Xu [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Visible-infrared person re-identification; Synthetic modality; Hetero-modality fusion; Dual-level alignment;
D O I
10.1007/978-981-97-8620-6_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visible-Infrared Person Re-identification (VI-ReID) is a challenging task that involves matching visible and infrared person images across multiple camera views. The huge gap between visible and infrared modalities has become a significant bottleneck. Existing works typically employ dual-stream networks to extract shared modality representation, yet struggle to relieve such gap, resulting in inferior performance. To overcome this issue, we propose a robust synthetic modality learning with a dual-level alignment method (RSDL) for VI-ReID that aims to generate a robust synthetic modality as a bridge to guide cross-modality alignment. Specifically, the hetero-modality fusion (HMF) strategy is introduced to generate the robust synthetic modality by using multi-scale feature fusion with a structure rebuild module (SRM) and a cross-modality spatial alignment (CSA) module. The strategy incorporates rich semantic structural patterns from visible and infrared images to handle the modality variation. Additionally, we design the dual-level regulation loss to jointly explore the stable feature relationships among three modalities at both the instance and distribution levels for cross-modality alignment. This facilitates discovering modality-consistent and identity-aware representations. Extensive experiments on three VI-ReID benchmarks demonstrate the effectiveness of our proposed method.
引用
收藏
页码:289 / 303
页数:15
相关论文
共 50 条
  • [21] Dual-Stream Transformer With Distribution Alignment for Visible-Infrared Person Re-Identification
    Chai, Zehua
    Ling, Yongguo
    Luo, Zhiming
    Lin, Dazhen
    Jiang, Min
    Li, Shaozi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6764 - 6776
  • [22] Learning enhancing modality-invariant features for visible-infrared person re-identification
    Zhang, La
    Zhao, Xu
    Du, Haohua
    Sun, Jian
    Wang, Jinqiao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (01) : 55 - 73
  • [23] An efficient framework for visible-infrared cross modality person re-identification
    Basaran, Emrah
    Gokmen, Muhittin
    Kamasak, Mustafa E.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 87
  • [24] Cross-Modality Transformer for Visible-Infrared Person Re-Identification
    Jiang, Kongzhu
    Zhang, Tianzhu
    Liu, Xiang
    Qian, Bingqiao
    Zhang, Yongdong
    Wu, Feng
    COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 480 - 496
  • [25] Dual-Semantic Consistency Learning for Visible-Infrared Person Re-Identification
    Zhang, Yiyuan
    Kang, Yuhao
    Zhao, Sanyuan
    Shen, Jianbing
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1554 - 1565
  • [26] Learning dual attention enhancement feature for visible-infrared person re-identification
    Zhang, Guoqing
    Zhang, Yinyin
    Zhang, Hongwei
    Chen, Yuhao
    Zheng, Yuhui
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 99
  • [27] Bidirectional modality information interaction for Visible-Infrared Person Re-identification
    Yang, Xi
    Liu, Huanling
    Wang, Nannan
    Gao, Xinbo
    PATTERN RECOGNITION, 2025, 161
  • [28] Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification
    Liang, Tengfei
    Jin, Yi
    Liu, Wu
    Wang, Tao
    Feng, Songhe
    Li, Yidong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7683 - 7698
  • [29] On learning distribution alignment for video-based visible-infrared person re-identification
    Fang, Pengfei
    Hu, Yaojun
    Zhu, Shipeng
    Xue, Hui
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 237
  • [30] Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification
    Liang, Tengfei
    Jin, Yi
    Liu, Wu
    Wang, Tao
    Feng, Songhe
    Li, Yidong
    arXiv, 2023,