MSSA: Multispectral Semantic Alignment for Cross-Modality Infrared-RGB Person Reidentification

被引:1
|
作者
Chen, Qingshan [1 ]
Zhang, Moyan [1 ]
Quan, Zhenzhen [1 ]
Zhang, Yumeng [1 ]
Mozerov, Mikhail G. [2 ]
Zhai, Chao [1 ]
Li, Hongjuan [1 ]
Li, Yujun [1 ]
机构
[1] Shandong Univ, Sch Informat Sci & Engn, Qingdao 266237, Peoples R China
[2] Univ Autonoma Barcelona, Comp Vis Ctr, Barcelona 08192, Spain
基金
国家重点研发计划;
关键词
Cross-modality; infrared-RGB; person reidentification; spectral semantic alignment (SSA);
D O I
10.1109/TCSS.2024.3403691
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The widespread deployment of dual-camera systems has laid a solid foundation for practical applications of infrared (IR)-RGB cross-modality person reidentification (ReID). However, the inherent modality differences between RGB and IR images cause significant intra-class variances in the feature space for individuals of the same identity. Current methods typically employ various network architectures for the image style transfer or extracting modality-invariant features, yet they overlook the information extraction from the most fundamental spectral semantic features. Based on the existing approaches, we propose a multi-spectral semantic alignment (MSSA) architecture aimed at aligning fine-grained spectral semantic features across both intra-modality and inter-modality perspectives. Through modality center semantic alignment (MCSA) learning, we comprehensively mitigate differences in identity features of different modalities. Moreover, to attenuate the discriminative information unique to a single modality, we introduce the modality reliability intensification (MRI) loss to enhance the reliability of identity information. Finally, to tackle the challenge that inter-modality intra-class disparities surpass inter-modality inter-class differences, we leverage the dynamic discriminative center (DDC) loss to further bolster the discriminability of reliable information. Through an extensive experiments conducted on SYSU-MM01, RegDB, and LLCM datasets, we demonstrate the substantial advantages of the proposed MSSA over other state-of-the-art methods.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Cyclic Cross-Modality Interaction for Hyperspectral and Multispectral Image Fusion
    Chen, Shi
    Zhang, Lefei
    Zhang, Liangpei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 741 - 753
  • [42] Cross-modality complementary information fusion for multispectral pedestrian detection
    Yan, Chaoqi
    Zhang, Hong
    Li, Xuliang
    Yang, Yifan
    Yuan, Ding
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (14): : 10361 - 10386
  • [43] Cross-Modality Pyramid Alignment for Visual Intention Understanding
    Ye, Mang
    Shi, Qinghongya
    Su, Kehua
    Du, Bo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2190 - 2201
  • [44] Cross-modality interactive attention network for multispectral pedestrian detection
    Zhang, Lu
    Liu, Zhiyong
    Zhang, Shifeng
    Yang, Xu
    Qiao, Hong
    Huang, Kaizhu
    Hussain, Amir
    INFORMATION FUSION, 2019, 50 : 20 - 29
  • [45] Cross-modality complementary information fusion for multispectral pedestrian detection
    Chaoqi Yan
    Hong Zhang
    Xuliang Li
    Yifan Yang
    Ding Yuan
    Neural Computing and Applications, 2023, 35 : 10361 - 10386
  • [46] SENTENCE AND PICTURE MEMORY - CROSS-MODALITY SEMANTIC INTEGRATION
    PEZDEK, K
    MARSH, G
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1975, 6 (NB4) : 435 - 435
  • [47] CROSS-MODALITY SEMANTIC INTEGRATION OF SENTENCE AND PICTURE MEMORY
    PEZDEK, K
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN LEARNING AND MEMORY, 1977, 3 (05): : 515 - 524
  • [48] RGB-INFRARED PAIRED-IMAGES GENERATION BASED ON FEATURE DISENTANGLE AND CROSS-MODALITY RECONSTRUCTION
    Li, Lingfei
    Zhang, Shun
    Ao, Guanshu
    Chu, Zunheng
    Mei, Shaohui
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6105 - 6108
  • [49] Visible-Infrared Person Re-Identification via Cross-Modality Interaction Transformer
    Feng, Yujian
    Yu, Jian
    Chen, Feng
    Ji, Yimu
    Wu, Fei
    Liu, Shangdon
    Jing, Xiao-Yuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7647 - 7659
  • [50] Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID
    Cheng, De
    He, Lingfeng
    Wang, Nannan
    Zhang, Shizhou
    Wang, Zhen
    Gao, Xinbo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1325 - 1333