Text-augmented Multi-Modality contrastive learning for unsupervised visible-infrared person re-identification

被引:0
|
作者
Sun, Rui [1 ,2 ,3 ]
Huang, Guoxi [1 ,2 ]
Wang, Xuebin [1 ,2 ]
Du, Yun [1 ,2 ]
Zhang, Xudong [1 ,2 ,3 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, 485 Danxia Rd, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Anhui Key Lab Ind Safety & Emergency Technol, Hefei 230009, Peoples R China
[3] Minist Educ Peoples Republ China, Key Lab Knowledge Engn Big Data, Hefei 230009, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Unsupervised person re-identification; Text-augmented features; Multi-Modality; Contrastive learning;
D O I
10.1016/j.imavis.2024.105310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visible-infrared person re-identification holds significant implications for intelligent security. Unsupervised methods can reduce the gap of different modalities without labels. Most previous unsupervised methods only train their models with image information, so that the model cannot obtain powerful deep semantic information. In this paper, we leverage CLIP to extract deep text information. We propose a Text-Image Alignment (TIA) module to align the image and text information and effectively bridge the gap between visible and infrared modality. We produce a Local-Global Image Match (LGIM) module to find homogeneous information. Specifically, we employ the Hungarian algorithm and Simulated Annealing (SA) algorithm to attain original information from image features while mitigating the interference of heterogeneous information. Additionally, we design a Changeable Cross-modality Alignment Loss (CCAL) to enable the model to learn modality-specific features during different training stages. Our method performs well and attains powerful robustness by targeted learning. Extensive experiments demonstrate the effectiveness of our approach, our method achieves a rank-1 accuracy that exceeds state-of-the-art approaches by approximately 10% on the RegDB.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Inter-Intra Modality Knowledge Learning and Clustering Noise Alleviation for Unsupervised Visible-Infrared Person Re-Identification
    Li, Zhiyong
    Liu, Haojie
    Peng, Xiantao
    Jiang, Wei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (08) : 3934 - 3947
  • [32] Multi-dimensional feature learning for visible-infrared person re-identification
    Yang, Zhenzhen
    Wu, Xinyi
    Yang, Yongpeng
    BIG DATA RESEARCH, 2025, 40
  • [33] Multi-Stage Auxiliary Learning for Visible-Infrared Person Re-Identification
    Zhang, Huadong
    Cheng, Shuli
    Du, Anyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 12032 - 12047
  • [34] Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification
    Liang, Tengfei
    Jin, Yi
    Liu, Wu
    Li, Yidong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8432 - 8444
  • [35] Tri-modality consistency optimization with heterogeneous augmented images for visible-infrared person re-identification
    Si, Tongzhen
    He, Fazhi
    Li, Penglei
    Gao, Xiaoxin
    NEUROCOMPUTING, 2023, 523 : 170 - 181
  • [36] Occluded Visible-Infrared Person Re-Identification
    Feng, Yujian
    Ji, Yimu
    Wu, Fei
    Gao, Guangwei
    Gao, Yang
    Liu, Tianliang
    Liu, Shangdong
    Jing, Xiao-Yuan
    Luo, Jiebo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1401 - 1413
  • [37] Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification
    Liu, Haojie
    Xia, Daoxun
    Jiang, Wei
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2023, 17 (03) : 545 - 559
  • [38] Learning Progressive Modality-Shared Transformers for Effective Visible-Infrared Person Re-identification
    Lu, Hu
    Zou, Xuezhang
    Zhang, Pingping
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1835 - 1843
  • [39] Adversarial Decoupling and Modality-Invariant Representation Learning for Visible-Infrared Person Re-Identification
    Hu, Weipeng
    Liu, Bohong
    Zeng, Haitang
    Hou, Yanke
    Hu, Haifeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5095 - 5109
  • [40] Visible-Infrared Person Re-identification via Modality Augmentation and Center Constraints
    Chen, Qiang
    Xiao, Guoqiang
    Wu, Jiahao
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 221 - 232