MSSA: Multispectral Semantic Alignment for Cross-Modality Infrared-RGB Person Reidentification

被引：1

作者：

Chen, Qingshan ^{[1
]}

Zhang, Moyan ^{[1
]}

Quan, Zhenzhen ^{[1
]}

Zhang, Yumeng ^{[1
]}

Mozerov, Mikhail G. ^{[2
]}

Zhai, Chao ^{[1
]}

Li, Hongjuan ^{[1
]}

Li, Yujun ^{[1
]}

机构：

[1] Shandong Univ, Sch Informat Sci & Engn, Qingdao 266237, Peoples R China

[2] Univ Autonoma Barcelona, Comp Vis Ctr, Barcelona 08192, Spain

来源：

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS | 2024年

基金：

国家重点研发计划;

关键词：

Cross-modality; infrared-RGB; person reidentification; spectral semantic alignment (SSA);

D O I：

10.1109/TCSS.2024.3403691

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The widespread deployment of dual-camera systems has laid a solid foundation for practical applications of infrared (IR)-RGB cross-modality person reidentification (ReID). However, the inherent modality differences between RGB and IR images cause significant intra-class variances in the feature space for individuals of the same identity. Current methods typically employ various network architectures for the image style transfer or extracting modality-invariant features, yet they overlook the information extraction from the most fundamental spectral semantic features. Based on the existing approaches, we propose a multi-spectral semantic alignment (MSSA) architecture aimed at aligning fine-grained spectral semantic features across both intra-modality and inter-modality perspectives. Through modality center semantic alignment (MCSA) learning, we comprehensively mitigate differences in identity features of different modalities. Moreover, to attenuate the discriminative information unique to a single modality, we introduce the modality reliability intensification (MRI) loss to enhance the reliability of identity information. Finally, to tackle the challenge that inter-modality intra-class disparities surpass inter-modality inter-class differences, we leverage the dynamic discriminative center (DDC) loss to further bolster the discriminability of reliable information. Through an extensive experiments conducted on SYSU-MM01, RegDB, and LLCM datasets, we demonstrate the substantial advantages of the proposed MSSA over other state-of-the-art methods.

引用

页数：16

共 50 条

[41] Cyclic Cross-Modality Interaction for Hyperspectral and Multispectral Image Fusion
Chen, Shi
Zhang, Lefei
Zhang, Liangpei
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 741 - 753
[42] Cross-modality complementary information fusion for multispectral pedestrian detection
Yan, Chaoqi
Zhang, Hong
Li, Xuliang
Yang, Yifan
Yuan, Ding
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (14): : 10361 - 10386
[43] Cross-Modality Pyramid Alignment for Visual Intention Understanding
Ye, Mang
Shi, Qinghongya
Su, Kehua
Du, Bo
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2190 - 2201
[44] Cross-modality interactive attention network for multispectral pedestrian detection
Zhang, Lu
Liu, Zhiyong
Zhang, Shifeng
Yang, Xu
Qiao, Hong
Huang, Kaizhu
Hussain, Amir
INFORMATION FUSION, 2019, 50 : 20 - 29
[45] Cross-modality complementary information fusion for multispectral pedestrian detection
Chaoqi Yan
Hong Zhang
Xuliang Li
Yifan Yang
Ding Yuan
Neural Computing and Applications, 2023, 35 : 10361 - 10386
[46] SENTENCE AND PICTURE MEMORY - CROSS-MODALITY SEMANTIC INTEGRATION
PEZDEK, K
MARSH, G
BULLETIN OF THE PSYCHONOMIC SOCIETY, 1975, 6 (NB4) : 435 - 435
[47] CROSS-MODALITY SEMANTIC INTEGRATION OF SENTENCE AND PICTURE MEMORY
PEZDEK, K
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN LEARNING AND MEMORY, 1977, 3 (05): : 515 - 524
[48] RGB-INFRARED PAIRED-IMAGES GENERATION BASED ON FEATURE DISENTANGLE AND CROSS-MODALITY RECONSTRUCTION
Li, Lingfei
Zhang, Shun
Ao, Guanshu
Chu, Zunheng
Mei, Shaohui
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6105 - 6108
[49] Visible-Infrared Person Re-Identification via Cross-Modality Interaction Transformer
Feng, Yujian
Yu, Jian
Chen, Feng
Ji, Yimu
Wu, Fei
Liu, Shangdon
Jing, Xiao-Yuan
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7647 - 7659
[50] Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID
Cheng, De
He, Lingfeng
Wang, Nannan
Zhang, Shizhou
Wang, Zhen
Gao, Xinbo
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1325 - 1333

← 1 2 3 4 5 →