Text-augmented Multi-Modality contrastive learning for unsupervised visible-infrared person re-identification

被引：0

作者：

Sun, Rui ^{[1
,2
,3
]}

Huang, Guoxi ^{[1
,2
]}

Wang, Xuebin ^{[1
,2
]}

Du, Yun ^{[1
,2
]}

Zhang, Xudong ^{[1
,2
,3
]}

机构：

[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, 485 Danxia Rd, Hefei 230009, Peoples R China

[2] Hefei Univ Technol, Anhui Key Lab Ind Safety & Emergency Technol, Hefei 230009, Peoples R China

[3] Minist Educ Peoples Republ China, Key Lab Knowledge Engn Big Data, Hefei 230009, Peoples R China

来源：

IMAGE AND VISION COMPUTING | 2024年 / 152卷

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Unsupervised person re-identification; Text-augmented features; Multi-Modality; Contrastive learning;

D O I：

10.1016/j.imavis.2024.105310

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visible-infrared person re-identification holds significant implications for intelligent security. Unsupervised methods can reduce the gap of different modalities without labels. Most previous unsupervised methods only train their models with image information, so that the model cannot obtain powerful deep semantic information. In this paper, we leverage CLIP to extract deep text information. We propose a Text-Image Alignment (TIA) module to align the image and text information and effectively bridge the gap between visible and infrared modality. We produce a Local-Global Image Match (LGIM) module to find homogeneous information. Specifically, we employ the Hungarian algorithm and Simulated Annealing (SA) algorithm to attain original information from image features while mitigating the interference of heterogeneous information. Additionally, we design a Changeable Cross-modality Alignment Loss (CCAL) to enable the model to learn modality-specific features during different training stages. Our method performs well and attains powerful robustness by targeted learning. Extensive experiments demonstrate the effectiveness of our approach, our method achieves a rank-1 accuracy that exceeds state-of-the-art approaches by approximately 10% on the RegDB.

引用

页数：9

共 50 条

[1] Augmented Dual-Contrastive Aggregation Learning for Unsupervised Visible-Infrared Person Re-Identification
Yang, Bin
Ye, Mang
Chen, Jun
Wu, Zesen
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2843 - 2851
[2] Two-stage contrastive learning for unsupervised visible-infrared person re-identification
Zou, Yuan
Zhu, Pengxu
Yang, Jianwei
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
[3] Image-text feature learning for unsupervised visible-infrared person re-identification
Guo, Jifeng
Pang, Zhiqi
IMAGE AND VISION COMPUTING, 2025, 158
[4] Contrastive Learning with Information Compensation for Visible-Infrared Person Re-Identification
Zhang, La
Guo, Haiyun
Zhao, Xu
Sun, Jian
Wang, Jinqiao
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1266 - 1271
[5] Hybrid Modality Metric Learning for Visible-Infrared Person Re-Identification
Zhang, La
Guo, Haiyun
Zhu, Kuan
Qiao, Honglin
Huang, Gaopan
Zhang, Sen
Zhang, Huichen
Sun, Jian
Wang, Jinqiao
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
[6] Robust Duality Learning for Unsupervised Visible-Infrared Person Re-Identification
Li, Yongxiang
Sun, Yuan
Qin, Yang
Peng, Dezhong
Peng, Xi
Hu, Peng
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 1937 - 1948
[7] Beyond a strong baseline: cross-modality contrastive learning for visible-infrared person re-identification
Pengfei Fang
Yukang Zhang
Zhenzhong Lan
Machine Vision and Applications, 2023, 34
[8] Beyond a strong baseline: cross-modality contrastive learning for visible-infrared person re-identification
Fang, Pengfei
Zhang, Yukang
Lan, Zhenzhong
MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
[9] Modality-Shared Prototypes for Enhanced Unsupervised Visible-Infrared Person Re-Identification
Chen, Xiaohan
Wang, Suqing
Zheng, Yujin
PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 237 - 250
[10] Modality Unifying Network for Visible-Infrared Person Re-Identification
Yu, Hao
Cheng, Xu
Peng, Wei
Liu, Weihao
Zhao, Guoying
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11151 - 11161

← 1 2 3 4 5 →