PMG-Pyramidal Multi-Granular Matching for Text-Based Person Re-Identification

被引:1
|
作者
Liu, Chao [1 ]
Xue, Jingyi [2 ]
Wang, Zijie [2 ]
Zhu, Aichun [2 ]
机构
[1] Jinling Inst Technol, Sch Intelligent Sci & Control Engn, Nanjing 211199, Peoples R China
[2] Nanjing Tech Univ, Sch Comp Sci & Technol, Nanjing 211816, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 21期
关键词
text-based person retrieval; person re-identification; multi-granular matching;
D O I
10.3390/app132111876
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Given a textual query, text-based person re-identification is supposed to search for the targeted pedestrian images from a large-scale visual database. Due to the inherent heterogeneity between different modalities, it is challenging to measure the cross-modal affinity between visual and textual data. Existing works typically employ single-granular methods to extract local features and align image regions with relevant words/phrases. Nevertheless, the limited robustness of single-granular methods cannot adapt to the imprecision and variances of visual and textual features, which are usually influenced by the background clutter, position transformation, posture diversity, and occlusion in surveillance videos, thereby leading to the deterioration of cross-modal matching accuracy. In this paper, we propose a Pyramidal Multi-Granular matching network (PMG) that incorporates a gradual transition process between the coarsest global information and the finest local information by a coarse-to-fine pyramidal method for multi-granular cross-modal features extraction and affinities learning. For each body part of a pedestrian, PMG is adequate in ensuring the integrity of local information while minimizing the surrounding interference signals at a certain scale and can adapt to capture discriminative signals of different body parts and achieve semantically alignment between image strips with relevant textual descriptions, thus suppressing the variances of feature extraction and improving the robustness of feature matching. Comprehensive experiments are conducted on the CUHK-PEDES and RSTPReid datasets to validate the effectiveness of the proposed method and results show that PMG outperforms state-of-the-art (SOTA) methods significantly and yields competitive accuracy of cross-modal retrieval.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Part Matching with Multi-level Attention for Person Re-Identification
    Wang, Jiaze
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1805 - 1814
  • [32] Text Based Unsupervised Domain Generalization Person Re-identification
    Zhang, Guoqing
    Jin, Tong
    Liu, Tianqi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XV, 2025, 15045 : 377 - 391
  • [33] Multi-Granularity Matching Transformer for Text-Based Person Search
    Bao, Liping
    Wei, Longhui
    Zhou, Wengang
    Liu, Lin
    Xie, Lingxi
    Li, Houqiang
    Tian, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4281 - 4293
  • [34] Prototype-guided Cross-modal Completion and Alignment for Incomplete Text-based Person Re-identification
    Gong, Tiantian
    Du, Guodong
    Wang, Junsheng
    Ding, Yongkang
    Zhang, Liyan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5253 - 5261
  • [35] Horizontal Pyramid Matching for Person Re-Identification
    Fu, Yang
    Wei, Yunchao
    Zhou, Yuqian
    Shi, Honghui
    Huang, Gao
    Wang, Xinchao
    Yao, Zhiqiang
    Huang, Thomas
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8295 - 8302
  • [36] Sparse representation matching for person re-identification
    An, Le
    Chen, Xiaojing
    Yang, Songfan
    Bhanu, Bir
    INFORMATION SCIENCES, 2016, 355 : 74 - 89
  • [37] Person re-identification by unsupervised video matching
    Ma, Xiaolong
    Zhu, Xiatian
    Gong, Shaogang
    Xie, Xudong
    Hu, Jianming
    Lam, Kin-Man
    Zhong, Yisheng
    PATTERN RECOGNITION, 2017, 65 : 197 - 210
  • [38] A Text-Based Dual-Branch Person Re-Identification Algorithm Based on the Deep Attribute Information Mining Network
    Han, Ke
    Zhang, Xiyan
    Xu, Wenlong
    Jin, Long
    SYMMETRY-BASEL, 2025, 17 (01):
  • [39] Multi-signature based person re-identification
    Martinel, N.
    Foresti, G. L.
    ELECTRONICS LETTERS, 2012, 48 (13) : 765 - 767
  • [40] Multi-Scale Transformer-Based Matching Network for Generalizable Person Re-Identification
    Jiang, Jinhua
    Zhang, Wenfeng
    Ran, Ruisheng
    Hu, Wei
    Dai, Jiangyan
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 (1277-1281) : 1277 - 1281