PMG-Pyramidal Multi-Granular Matching for Text-Based Person Re-Identification

被引:1
|
作者
Liu, Chao [1 ]
Xue, Jingyi [2 ]
Wang, Zijie [2 ]
Zhu, Aichun [2 ]
机构
[1] Jinling Inst Technol, Sch Intelligent Sci & Control Engn, Nanjing 211199, Peoples R China
[2] Nanjing Tech Univ, Sch Comp Sci & Technol, Nanjing 211816, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 21期
关键词
text-based person retrieval; person re-identification; multi-granular matching;
D O I
10.3390/app132111876
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Given a textual query, text-based person re-identification is supposed to search for the targeted pedestrian images from a large-scale visual database. Due to the inherent heterogeneity between different modalities, it is challenging to measure the cross-modal affinity between visual and textual data. Existing works typically employ single-granular methods to extract local features and align image regions with relevant words/phrases. Nevertheless, the limited robustness of single-granular methods cannot adapt to the imprecision and variances of visual and textual features, which are usually influenced by the background clutter, position transformation, posture diversity, and occlusion in surveillance videos, thereby leading to the deterioration of cross-modal matching accuracy. In this paper, we propose a Pyramidal Multi-Granular matching network (PMG) that incorporates a gradual transition process between the coarsest global information and the finest local information by a coarse-to-fine pyramidal method for multi-granular cross-modal features extraction and affinities learning. For each body part of a pedestrian, PMG is adequate in ensuring the integrity of local information while minimizing the surrounding interference signals at a certain scale and can adapt to capture discriminative signals of different body parts and achieve semantically alignment between image strips with relevant textual descriptions, thus suppressing the variances of feature extraction and improving the robustness of feature matching. Comprehensive experiments are conducted on the CUHK-PEDES and RSTPReid datasets to validate the effectiveness of the proposed method and results show that PMG outperforms state-of-the-art (SOTA) methods significantly and yields competitive accuracy of cross-modal retrieval.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Pyramidal Transformer with Conv-Patchify for Person Re-identification
    Li, He
    Ye, Mang
    Wang, Cong
    Bo
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7317 - 7326
  • [42] KEYPOINT-BASED FEATURE MATCHING FOR PARTIAL PERSON RE-IDENTIFICATION
    Han, Chuchu
    Gao, Changxin
    Sang, Nong
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 226 - 230
  • [43] Viewpoint Transform Matching model for person re-identification
    Zheng, Ruochen
    Gao, Changxin
    Sang, Nong
    NEUROCOMPUTING, 2021, 433 : 19 - 27
  • [44] A Multiple Component Matching Framework for Person Re-identification
    Satta, Riccardo
    Fumera, Giorgio
    Roli, Fabio
    Cristani, Marco
    Murino, Vittorio
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2011, PT II, 2011, 6979 (II): : 140 - 149
  • [45] Constraint Patch Matching for Faster Person Re-identification
    Lejbolle, Aske R.
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    2017 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2017,
  • [46] Towards More Reliable Matching for Person Re-identification
    Li, Xiang
    Wu, Ancong
    Cao, Mei
    You, Jinjie
    Zheng, Wei-Shi
    2015 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2015,
  • [47] Person re-identification based on multi-appearance model
    Lei Huang
    Wenfeng Zhang
    Jie Nie
    Zhiqiang Wei
    Multimedia Tools and Applications, 2021, 80 : 16413 - 16423
  • [48] Person re-identification based on multi-appearance model
    Huang, Lei
    Zhang, Wenfeng
    Nie, Jie
    Wei, Zhiqiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 16413 - 16423
  • [49] Cross-Modal Dual Matching and Comparison for Text-to-Image Person Re-identification
    Cao, Lin
    Sun, Wenwen
    Guo, Yanan
    Wang, Shoujing
    Lv, Boqian
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 246 - 259
  • [50] Person re-identification based on saliency
    Wang Cailing
    Tang Song
    Zhu Songhao
    Jing Xiaoyuan
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3887 - 3890