PMG-Pyramidal Multi-Granular Matching for Text-Based Person Re-Identification

被引:1
|
作者
Liu, Chao [1 ]
Xue, Jingyi [2 ]
Wang, Zijie [2 ]
Zhu, Aichun [2 ]
机构
[1] Jinling Inst Technol, Sch Intelligent Sci & Control Engn, Nanjing 211199, Peoples R China
[2] Nanjing Tech Univ, Sch Comp Sci & Technol, Nanjing 211816, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 21期
关键词
text-based person retrieval; person re-identification; multi-granular matching;
D O I
10.3390/app132111876
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Given a textual query, text-based person re-identification is supposed to search for the targeted pedestrian images from a large-scale visual database. Due to the inherent heterogeneity between different modalities, it is challenging to measure the cross-modal affinity between visual and textual data. Existing works typically employ single-granular methods to extract local features and align image regions with relevant words/phrases. Nevertheless, the limited robustness of single-granular methods cannot adapt to the imprecision and variances of visual and textual features, which are usually influenced by the background clutter, position transformation, posture diversity, and occlusion in surveillance videos, thereby leading to the deterioration of cross-modal matching accuracy. In this paper, we propose a Pyramidal Multi-Granular matching network (PMG) that incorporates a gradual transition process between the coarsest global information and the finest local information by a coarse-to-fine pyramidal method for multi-granular cross-modal features extraction and affinities learning. For each body part of a pedestrian, PMG is adequate in ensuring the integrity of local information while minimizing the surrounding interference signals at a certain scale and can adapt to capture discriminative signals of different body parts and achieve semantically alignment between image strips with relevant textual descriptions, thus suppressing the variances of feature extraction and improving the robustness of feature matching. Comprehensive experiments are conducted on the CUHK-PEDES and RSTPReid datasets to validate the effectiveness of the proposed method and results show that PMG outperforms state-of-the-art (SOTA) methods significantly and yields competitive accuracy of cross-modal retrieval.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Relation network based on multi-granular hypergraphs for person re-identification
    Guo, Chenchen
    Zhao, Xiaoming
    Zou, Qiang
    APPLIED INTELLIGENCE, 2022, 52 (10) : 11394 - 11406
  • [2] Relation network based on multi-granular hypergraphs for person re-identification
    Chenchen Guo
    Xiaoming Zhao
    Qiang Zou
    Applied Intelligence, 2022, 52 : 11394 - 11406
  • [3] Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification
    Yan, Yichao
    Qin, Jie
    Chen, Jiaxin
    Liu, Li
    Zhu, Fan
    Tai, Ying
    Shao, Ling
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2896 - 2905
  • [4] BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification
    Fujii, Takuro
    Tarashima, Shuhei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2778 - 2782
  • [5] Weakly Supervised Text-based Person Re-Identification
    Zhao, Shizhen
    Gao, Changxin
    Shao, Yuanjie
    Zheng, Wei-Shi
    Sang, Nong
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11375 - 11384
  • [6] Decentralized Text-Based Person Re-Identification in Multi-Camera Networks
    Agyeman, Rockson
    Rinner, Bernhard
    IEEE ACCESS, 2024, 12 : 172125 - 172148
  • [7] Parallel Data Augmentation for Text-based Person Re-identification
    Cai, Han-Qing
    Li, Xin
    Ji, Yi
    Li, Ying
    Liu, Chun-Ping
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [8] MINING FALSE POSITIVE EXAMPLES FOR TEXT-BASED PERSON RE-IDENTIFICATION
    Xu, Wenhao
    Shao, Zhiyin
    Ding, Changxing
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1680 - 1684
  • [9] MSTN: A Multi-granular Spatial-Temporal Network for video-based person re-identification
    Zhao, Wei
    Zhang, Bo
    Yang, Cong
    Chen, Xianfu
    Chen, Hui
    INTERNET OF THINGS, 2022, 20
  • [10] AMEN: Adversarial Multi-space Embedding Network for Text-Based Person Re-identification
    Wang, Zijie
    Xue, Jingyi
    Zhu, Aichun
    Li, Yifeng
    Zhang, Mingyi
    Zhong, Chongliang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 462 - 473