Cross-modality person re-identification based on intermediate modal generation

被引:1
|
作者
Lu, Jian [1 ]
Zhang, Shasha [1 ]
Chen, Mengdie [1 ]
Chen, Xiaogai [1 ]
Zhang, Kaibing [1 ]
机构
[1] Xian Polytech Univ, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Cross-modality; Intermediate modality generation;
D O I
10.1016/j.optlaseng.2024.108117
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In the context of cross -modal person re -identification, researchers often employ methods that utilize visible modality information to generate both an 'X' modality and a grayscale modality, enhancing the accuracy of person re -identification models. A lightweight network causes the 'X' modality through self -supervised learning of labels from visible images. In contrast, the grayscale modality is obtained through simple linear accumulation of the three RGB color channels from visual images. It can be observed that both the 'X' modality and grayscale modality are derived from visible images, which fails to establish a connection between the visible and infrared modalities. Therefore, this paper proposes an intermediate modality generation module to produce intermediate modality representations dynamically. By combining information from the visible, infrared, and intermediate modalities, the model is encouraged to capture modality -invariant features with cross -modal consistency. This enables person of the same identity to exhibit similar feature representations across different modalities, thereby mitigating the impact of distribution differences between visible and infrared modalities. Additionally, to facilitate the learning of appropriate intermediate modality representations, a distribution migration strategy is introduced. It guides the intermediate modality to maintain the correct distance from the visible and infrared modalities by optimizing the weights of the loss functions, preventing inadequate feature learning caused by an excessive focus on specific modality information. Furthermore, a mixed augmentation approach is proposed to alleviate disparities among multiple modalities further. By randomly cropping and combining regions of visible (infrared) modality images with infrared (visible) modality images, the generalization ability of the model in heterogeneous modalities is enhanced. Extensive comparative experiments are conducted on the SYSU-MM01 and RegDB datasets, yielding mAP values of 57.2% and 85.82%, respectively. The superior mAP performance on the RegDB dataset compared to most existing methods validates the effectiveness of the proposed approach.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Distance based Training for Cross-Modality Person Re-Identification
    Tekeli, Nihat
    Can, Ahmet Burak
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4540 - 4549
  • [2] Modality interactive attention for cross-modality person re-identification
    Zou, Zilin
    Chen, Ying
    [J]. IMAGE AND VISION COMPUTING, 2024, 148
  • [3] A Survey on Cross-Modality Heterogeneous Person Re-identification
    Sun R.
    Zhao Z.
    Yang Z.
    Gao J.
    [J]. Sun, Rui (sunrui@hfut.edu.cn), 1600, Science Press (33): : 1066 - 1082
  • [4] Cross-Modality Person Re-Identification Method with Joint-Modality Generation and Feature Enhancement
    Bi, Yihan
    Wang, Rong
    Zhou, Qianli
    Zeng, Zhaolong
    Lin, Ronghui
    Wang, Mingjie
    [J]. ENTROPY, 2024, 26 (08)
  • [5] Self-attention Cross-modality Fusion Network for Cross-modality Person Re-identification
    Du P.
    Song Y.-H.
    Zhang X.-Y.
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (06): : 1457 - 1468
  • [6] Hierarchical Feature Fusion for Cross-Modality Person Re-identification
    Fu, Wen
    Lim, Monghao
    [J]. International Journal of Pattern Recognition and Artificial Intelligence, 2024, 38 (16)
  • [7] Dynamic feature weakening for cross-modality person re-identification*
    Lu, Jian
    Chen, Mengdie
    Wang, Hangying
    Pang, Feifei
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 109
  • [8] RGB-Infrared Cross-Modality Person Re-Identification
    Wu, Ancong
    Zheng, Wei-Shi
    Yu, Hong-Xing
    Gong, Shaogang
    Lai, Jianhuang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5390 - 5399
  • [9] Channel decoupling network for cross-modality person re-identification
    Chen, Jingying
    Chen, Chang
    Tan, Lei
    Peng, Shixin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 14091 - 14105
  • [10] Cross-Modality Person Re-Identification with Generative Adversarial Training
    Dai, Pingyang
    Ji, Rongrong
    Wang, Haibin
    Wu, Qiong
    Huang, Yuyu
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 677 - 683