Cross-modality person re-identification based on intermediate modal generation

被引：1

作者：

Lu, Jian ^{[1
]}

Zhang, Shasha ^{[1
]}

Chen, Mengdie ^{[1
]}

Chen, Xiaogai ^{[1
]}

Zhang, Kaibing ^{[1
]}

机构：

[1] Xian Polytech Univ, Xian, Peoples R China

来源：

OPTICS AND LASERS IN ENGINEERING | 2024年 / 177卷

基金：

中国国家自然科学基金;

关键词：

Person re-identification; Cross-modality; Intermediate modality generation;

D O I：

10.1016/j.optlaseng.2024.108117

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

In the context of cross -modal person re -identification, researchers often employ methods that utilize visible modality information to generate both an 'X' modality and a grayscale modality, enhancing the accuracy of person re -identification models. A lightweight network causes the 'X' modality through self -supervised learning of labels from visible images. In contrast, the grayscale modality is obtained through simple linear accumulation of the three RGB color channels from visual images. It can be observed that both the 'X' modality and grayscale modality are derived from visible images, which fails to establish a connection between the visible and infrared modalities. Therefore, this paper proposes an intermediate modality generation module to produce intermediate modality representations dynamically. By combining information from the visible, infrared, and intermediate modalities, the model is encouraged to capture modality -invariant features with cross -modal consistency. This enables person of the same identity to exhibit similar feature representations across different modalities, thereby mitigating the impact of distribution differences between visible and infrared modalities. Additionally, to facilitate the learning of appropriate intermediate modality representations, a distribution migration strategy is introduced. It guides the intermediate modality to maintain the correct distance from the visible and infrared modalities by optimizing the weights of the loss functions, preventing inadequate feature learning caused by an excessive focus on specific modality information. Furthermore, a mixed augmentation approach is proposed to alleviate disparities among multiple modalities further. By randomly cropping and combining regions of visible (infrared) modality images with infrared (visible) modality images, the generalization ability of the model in heterogeneous modalities is enhanced. Extensive comparative experiments are conducted on the SYSU-MM01 and RegDB datasets, yielding mAP values of 57.2% and 85.82%, respectively. The superior mAP performance on the RegDB dataset compared to most existing methods validates the effectiveness of the proposed approach.

引用

页数：10

共 50 条

[1] Distance based Training for Cross-Modality Person Re-Identification
Tekeli, Nihat
Can, Ahmet Burak
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4540 - 4549
[2] Modality interactive attention for cross-modality person re-identification
Zou, Zilin
Chen, Ying
[J]. IMAGE AND VISION COMPUTING, 2024, 148
[3] A Survey on Cross-Modality Heterogeneous Person Re-identification
Sun R.
Zhao Z.
Yang Z.
Gao J.
[J]. Sun, Rui (sunrui@hfut.edu.cn), 1600, Science Press (33): : 1066 - 1082
[4] Cross-Modality Person Re-Identification Method with Joint-Modality Generation and Feature Enhancement
Bi, Yihan
Wang, Rong
Zhou, Qianli
Zeng, Zhaolong
Lin, Ronghui
Wang, Mingjie
[J]. ENTROPY, 2024, 26 (08)
[5] Self-attention Cross-modality Fusion Network for Cross-modality Person Re-identification
Du P.
Song Y.-H.
Zhang X.-Y.
[J]. Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (06): : 1457 - 1468
[6] Hierarchical Feature Fusion for Cross-Modality Person Re-identification
Fu, Wen
Lim, Monghao
[J]. International Journal of Pattern Recognition and Artificial Intelligence, 2024, 38 (16)
[7] Dynamic feature weakening for cross-modality person re-identification*
Lu, Jian
Chen, Mengdie
Wang, Hangying
Pang, Feifei
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 109
[8] RGB-Infrared Cross-Modality Person Re-Identification
Wu, Ancong
Zheng, Wei-Shi
Yu, Hong-Xing
Gong, Shaogang
Lai, Jianhuang
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5390 - 5399
[9] Channel decoupling network for cross-modality person re-identification
Chen, Jingying
Chen, Chang
Tan, Lei
Peng, Shixin
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 14091 - 14105
[10] Cross-Modality Person Re-Identification with Generative Adversarial Training
Dai, Pingyang
Ji, Rongrong
Wang, Haibin
Wu, Qiong
Huang, Yuyu
[J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 677 - 683

← 1 2 3 4 5 →