MSSA: Multispectral Semantic Alignment for Cross-Modality Infrared-RGB Person Reidentification

被引:1
|
作者
Chen, Qingshan [1 ]
Zhang, Moyan [1 ]
Quan, Zhenzhen [1 ]
Zhang, Yumeng [1 ]
Mozerov, Mikhail G. [2 ]
Zhai, Chao [1 ]
Li, Hongjuan [1 ]
Li, Yujun [1 ]
机构
[1] Shandong Univ, Sch Informat Sci & Engn, Qingdao 266237, Peoples R China
[2] Univ Autonoma Barcelona, Comp Vis Ctr, Barcelona 08192, Spain
基金
国家重点研发计划;
关键词
Cross-modality; infrared-RGB; person reidentification; spectral semantic alignment (SSA);
D O I
10.1109/TCSS.2024.3403691
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The widespread deployment of dual-camera systems has laid a solid foundation for practical applications of infrared (IR)-RGB cross-modality person reidentification (ReID). However, the inherent modality differences between RGB and IR images cause significant intra-class variances in the feature space for individuals of the same identity. Current methods typically employ various network architectures for the image style transfer or extracting modality-invariant features, yet they overlook the information extraction from the most fundamental spectral semantic features. Based on the existing approaches, we propose a multi-spectral semantic alignment (MSSA) architecture aimed at aligning fine-grained spectral semantic features across both intra-modality and inter-modality perspectives. Through modality center semantic alignment (MCSA) learning, we comprehensively mitigate differences in identity features of different modalities. Moreover, to attenuate the discriminative information unique to a single modality, we introduce the modality reliability intensification (MRI) loss to enhance the reliability of identity information. Finally, to tackle the challenge that inter-modality intra-class disparities surpass inter-modality inter-class differences, we leverage the dynamic discriminative center (DDC) loss to further bolster the discriminability of reliable information. Through an extensive experiments conducted on SYSU-MM01, RegDB, and LLCM datasets, we demonstrate the substantial advantages of the proposed MSSA over other state-of-the-art methods.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] RGB-IR Person Re-identification by Cross-Modality Similarity Preservation
    Wu, Ancong
    Zheng, Wei-Shi
    Gong, Shaogang
    Lai, Jianhuang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (06) : 1765 - 1785
  • [22] RGB-IR Person Re-identification by Cross-Modality Similarity Preservation
    Ancong Wu
    Wei-Shi Zheng
    Shaogang Gong
    Jianhuang Lai
    International Journal of Computer Vision, 2020, 128 : 1765 - 1785
  • [23] RGB-D Domain adaptive semantic segmentation with cross-modality feature recalibration
    Fan, Qizhe
    Shen, Xiaoqin
    Ying, Shihui
    Wang, Juan
    Du, Shaoyi
    INFORMATION FUSION, 2025, 120
  • [24] Cross-modality interaction for few-shot multispectral object detection with semantic knowledge
    Huang, Lian
    Peng, Zongju
    Chen, Fen
    Dai, Shaosheng
    He, Ziqiang
    Liu, Kesheng
    NEURAL NETWORKS, 2024, 173
  • [25] Cross-modality interaction for few-shot multispectral object detection with semantic knowledge
    Huang, Lian
    Peng, Zongju
    Chen, Fen
    Dai, Shaosheng
    He, Ziqiang
    Liu, Kesheng
    Neural Networks, 2024, 173
  • [26] Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification
    Liang, Tengfei
    Jin, Yi
    Liu, Wu
    Li, Yidong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8432 - 8444
  • [27] Cross-Modality Transformer for Visible-Infrared Person Re-Identification
    Jiang, Kongzhu
    Zhang, Tianzhu
    Liu, Xiang
    Qian, Bingqiao
    Zhang, Yongdong
    Wu, Feng
    COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 480 - 496
  • [28] Deep Cross-Modality Alignment for Multi-Shot Person Re-IDentification
    Song, Zhichao
    Ni, Bingbing
    Yan, Yichao
    Ren, Zhe
    Xu, Yi
    Yang, Xiaokang
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 645 - 653
  • [29] Dual-granularity feature alignment for cross-modality person re-identification
    Yin, Junhui
    Ma, Zhanyu
    Xie, Jiyang
    Nie, Shibo
    Liang, Kongming
    Guo, Jun
    NEUROCOMPUTING, 2022, 511 : 78 - 90
  • [30] Dual-alignment Feature Embedding for Cross-modality Person Re-identification
    Hao, Yi
    Wang, Nannan
    Gao, Xinbo
    Li, Jie
    Wang, Xiaoyu
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 57 - 65