TIPCB: A simple but effective part-based convolutional baseline for text-based person search

被引:49
|
作者
Chen, Yuhao [1 ]
Zhang, Guoqing [1 ]
Lu, Yujiang [1 ]
Wang, Zhenxing [2 ]
Zheng, Yuhui [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Math & Stat, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modality; Person search; Local representation; NETWORK; REIDENTIFICATION;
D O I
10.1016/j.neucom.2022.04.081
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person search is a sub-task in the field of image retrieval, which aims to retrieve target person images according to a given textual description. The significant feature gap between two modalities makes this task very challenging. Many existing methods attempt to utilize local alignment to address this problem in the fine-grained level. However, most relevant methods introduce additional models or complicated training and evaluation strategies, which are hard to use in realistic scenarios. In order to facilitate the practical application, we propose a simple but effective baseline for text-based person search named TIPCB (i.e., Text-Image Part-based Convolutional Baseline). Firstly, a novel dual-path local alignment network structure is proposed to extract visual and textual local representations, in which images are segmented horizontally and texts are aligned adaptively. Then, we propose a multi-stage cross-modal matching strategy, which eliminates the modality gap from three feature levels, including low level, local level and global level. Extensive experiments are conducted on the widely-used benchmark datasets (CUHK-PEDES and ICFG-PEDES) and verify that our method outperforms all the existing methods. Our code has been released in https://github.com/OrangeYHChen/TIPCB. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:171 / 181
页数:11
相关论文
共 50 条
  • [31] LEARNING SEMANTIC-ALIGNED FEATURE REPRESENTATION FOR TEXT-BASED PERSON SEARCH
    Li, Shiping
    Cao, Min
    Zhang, Min
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2724 - 2728
  • [32] Text-based Person Search via Multi-Granularity Embedding Learning
    Wang, Chengji
    Luo, Zhiming
    Lin, Yaojin
    Li, Shaozi
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1068 - 1074
  • [33] Text-based person search via cross-modal alignment learning
    Ke, Xiao
    Liu, Hao
    Xu, Peirong
    Lin, Xinru
    Guo, Wenzhong
    PATTERN RECOGNITION, 2024, 152
  • [34] RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search
    Bai, Yang
    Cao, Min
    Gao, Daming
    Cao, Ziqiang
    Chen, Chen
    Fan, Zhenfeng
    Nie, Liqiang
    Zhang, Min
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 555 - 563
  • [35] Deep Adversarial Graph Attention Convolution Network for Text-Based Person Search
    Liu, Jiawei
    Zha, Zheng-Jun
    Hong, Richang
    Wang, Meng
    Zhang, Yongdong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 665 - 673
  • [36] Improving embedding learning by virtual attribute decoupling for text-based person search
    Chengji Wang
    Zhiming Luo
    Yaojin Lin
    Shaozi Li
    Neural Computing and Applications, 2022, 34 : 5625 - 5647
  • [37] PH-GCN: Person Retrieval With Part-Based Hierarchical Graph Convolutional Network
    Jiang, Bo
    Wang, Xixi
    Zheng, Aihua
    Tang, Jin
    Luo, Bin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3218 - 3228
  • [38] Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search
    You, Kaiyang
    Chen, Wenjing
    Wang, Chengji
    Sun, Hao
    Xie, Wei
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2230 - 2234
  • [39] Text-based person search by non-saliency enhancing and dynamic label smoothing
    Pang Y.
    Zhang C.
    Li Z.
    Wei C.
    Wang Z.
    Neural Computing and Applications, 2024, 36 (21) : 13327 - 13339
  • [40] Relation-aware aggregation network with auxiliary guidance for text-based person search
    Zeng, Pengpeng
    Jing, Shuaiqi
    Song, Jingkuan
    Fan, Kaixuan
    Li, Xiangpeng
    We, Liansuo
    Guo, Yuan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1565 - 1582