Text-augmented Multi-Modality contrastive learning for unsupervised visible-infrared person re-identification

被引:0
|
作者
Sun, Rui [1 ,2 ,3 ]
Huang, Guoxi [1 ,2 ]
Wang, Xuebin [1 ,2 ]
Du, Yun [1 ,2 ]
Zhang, Xudong [1 ,2 ,3 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, 485 Danxia Rd, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Anhui Key Lab Ind Safety & Emergency Technol, Hefei 230009, Peoples R China
[3] Minist Educ Peoples Republ China, Key Lab Knowledge Engn Big Data, Hefei 230009, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Unsupervised person re-identification; Text-augmented features; Multi-Modality; Contrastive learning;
D O I
10.1016/j.imavis.2024.105310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visible-infrared person re-identification holds significant implications for intelligent security. Unsupervised methods can reduce the gap of different modalities without labels. Most previous unsupervised methods only train their models with image information, so that the model cannot obtain powerful deep semantic information. In this paper, we leverage CLIP to extract deep text information. We propose a Text-Image Alignment (TIA) module to align the image and text information and effectively bridge the gap between visible and infrared modality. We produce a Local-Global Image Match (LGIM) module to find homogeneous information. Specifically, we employ the Hungarian algorithm and Simulated Annealing (SA) algorithm to attain original information from image features while mitigating the interference of heterogeneous information. Additionally, we design a Changeable Cross-modality Alignment Loss (CCAL) to enable the model to learn modality-specific features during different training stages. Our method performs well and attains powerful robustness by targeted learning. Extensive experiments demonstrate the effectiveness of our approach, our method achieves a rank-1 accuracy that exceeds state-of-the-art approaches by approximately 10% on the RegDB.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Augmented Dual-Contrastive Aggregation Learning for Unsupervised Visible-Infrared Person Re-Identification
    Yang, Bin
    Ye, Mang
    Chen, Jun
    Wu, Zesen
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2843 - 2851
  • [2] Two-stage contrastive learning for unsupervised visible-infrared person re-identification
    Zou, Yuan
    Zhu, Pengxu
    Yang, Jianwei
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
  • [3] Image-text feature learning for unsupervised visible-infrared person re-identification
    Guo, Jifeng
    Pang, Zhiqi
    IMAGE AND VISION COMPUTING, 2025, 158
  • [4] Contrastive Learning with Information Compensation for Visible-Infrared Person Re-Identification
    Zhang, La
    Guo, Haiyun
    Zhao, Xu
    Sun, Jian
    Wang, Jinqiao
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1266 - 1271
  • [5] Hybrid Modality Metric Learning for Visible-Infrared Person Re-Identification
    Zhang, La
    Guo, Haiyun
    Zhu, Kuan
    Qiao, Honglin
    Huang, Gaopan
    Zhang, Sen
    Zhang, Huichen
    Sun, Jian
    Wang, Jinqiao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [6] Robust Duality Learning for Unsupervised Visible-Infrared Person Re-Identification
    Li, Yongxiang
    Sun, Yuan
    Qin, Yang
    Peng, Dezhong
    Peng, Xi
    Hu, Peng
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 1937 - 1948
  • [7] Beyond a strong baseline: cross-modality contrastive learning for visible-infrared person re-identification
    Pengfei Fang
    Yukang Zhang
    Zhenzhong Lan
    Machine Vision and Applications, 2023, 34
  • [8] Beyond a strong baseline: cross-modality contrastive learning for visible-infrared person re-identification
    Fang, Pengfei
    Zhang, Yukang
    Lan, Zhenzhong
    MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
  • [9] Modality-Shared Prototypes for Enhanced Unsupervised Visible-Infrared Person Re-Identification
    Chen, Xiaohan
    Wang, Suqing
    Zheng, Yujin
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 237 - 250
  • [10] Modality Unifying Network for Visible-Infrared Person Re-Identification
    Yu, Hao
    Cheng, Xu
    Peng, Wei
    Liu, Weihao
    Zhao, Guoying
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11151 - 11161