TriMatch: Triple Matching for Text-to-Image Person Re-Identification

被引:0
|
作者
Yan, Shuanglin [1 ]
Dong, Neng [1 ]
Li, Shuang [2 ]
Li, Huafeng [3 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
[3] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Visualization; Text to image; Accuracy; Identification of persons; Vectors; Tuning; Transforms; Training; Head; Text-to-image person re-identification; heterogeneous gaps; cross-modal matching; unimodal matching; NETWORK;
D O I
10.1109/LSP.2025.3534689
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Text-to-image person re-identification (TIReID) is a cross-modal retrieval task that aims to retrieve target person images based on a given text description. Existing methods primarily focus on mining the semantic associations across modalities, relying on the matching between heterogeneous features for retrieval. However, due to the inherent heterogeneous gaps between modalities, it is challenging to establish precise semantic associations, particularly in fine-grained correspondences, often leading to incorrect retrieval results. To address this issue, this letter proposes an innovative Triple Matching (TriMatch) framework that integrates cross-modal (image-text) matching and unimodal (image-image, text-text) matching for high-precision person retrieval. The framework introduces a generation task that performs cross-modal (image-to-text and text-to-image) feature generation and intra-modal feature alig achieve unimodal matching. By incorporating the generation task, TriMatch considers not only the semantic correlations between modalities but also the semantic consistency within single modalities, thereby effectively enhancing the accuracy of target person retrieval. Extensive experiments on multiple datasets demonstrate the superiority of TriMatch over existing methods.
引用
收藏
页码:806 / 810
页数:5
相关论文
共 50 条
  • [41] Person re-identification using salient region matching game
    Li, Tiezhu
    Sun, Lijuan
    Han, Chong
    Guo, Jian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (16) : 21393 - 21415
  • [42] Multi-Channel Pyramid Person Matching Network for Person Re-Identification
    Mao, Chaojie
    Li, Yingming
    Zhang, Yaqing
    Zhang, Zhongfei
    Li, Xi
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7243 - 7250
  • [43] PERSON RE-IDENTIFICATION BASED ON HIERARCHICAL BIPARTITE GRAPH MATCHING
    Huang, Yan
    Sheng, Hao
    Xiong, Zhang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4255 - 4259
  • [44] Covariance Based Person Re-identification Using Spectral Matching
    Nanda, Aparajita
    Sa, Pankaj K.
    Majhi, Banshidhar
    2014 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2014,
  • [45] Weakly Supervised Text-based Person Re-Identification
    Zhao, Shizhen
    Gao, Changxin
    Shao, Yuanjie
    Zheng, Wei-Shi
    Sang, Nong
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11375 - 11384
  • [46] Text Based Unsupervised Domain Generalization Person Re-identification
    Zhang, Guoqing
    Jin, Tong
    Liu, Tianqi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XV, 2025, 15045 : 377 - 391
  • [47] PMG-Pyramidal Multi-Granular Matching for Text-Based Person Re-Identification
    Liu, Chao
    Xue, Jingyi
    Wang, Zijie
    Zhu, Aichun
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [48] Image-text feature learning for unsupervised visible-infrared person re-identification
    Guo, Jifeng
    Pang, Zhiqi
    IMAGE AND VISION COMPUTING, 2025, 158
  • [49] CLIP-Driven Fine-Grained Text-Image Person Re-Identification
    Yan, Shuanglin
    Dong, Neng
    Zhang, Liyan
    Tang, Jinhui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6032 - 6046
  • [50] AMM-GAN: Attribute-Matching Memory for Person Text-to-Image Generation
    Yue, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 146 - 158