An Overview of Text-based Person Search: Recent Advances and Future Directions

被引:0
|
作者
Niu K. [1 ]
Liu Y. [1 ]
Long Y. [1 ]
Huang Y. [3 ]
Wang L. [3 ]
Zhang Y. [1 ]
机构
[1] Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Beijing
基金
中国国家自然科学基金;
关键词
Benchmark testing; cross-modal retrieval; feature extraction; Feature extraction; Pedestrians; semantic alignments; Semantics; Task analysis; Text-based person search; Training; video surveillance; Visualization;
D O I
10.1109/TCSVT.2024.3376373
中图分类号
学科分类号
摘要
Due to the practical significance in smart video surveillance systems, Text-Based Person Search (TBPS) has been one of the research hotspots recently, which refers to searching for the interested pedestrian images given natural language sentences. To help researchers quickly grasp the developments of this important task, we comprehensively summarize the recent research advances of TBPS from two perspectives, <italic>i.e</italic>., Feature Extraction (FE) and Semantic Alignments (SA). Specifically, the FE mainly consists of pre-processing approaches and end-to-end frameworks, and the SA could be briefly divided into cross-modal attention mechanism, non-attention alignments, training objectives, and generative approaches. Afterwards, we elaborate four widely-used benchmarks and also the evaluation criterion for TBPS. And comparisons and analyses among the state-of-the-art (SOTA) solutions are provided based on these large-scale benchmarks. At last, we point out some future research directions that need to be further addressed, which will greatly facilitate the practical applications of TBPS. IEEE
引用
下载
收藏
页码:1 / 1
相关论文
共 50 条
  • [1] An Empirical Study of CLIP for Text-Based Person Search
    Cao, Min
    Bai, Yang
    Zeng, Ziyin
    Ye, Mang
    Zhang, Min
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 465 - 473
  • [2] Diverse Person: Customize Your Own Dataset for Text-Based Person Search
    Song, Zifan
    Hu, Guosheng
    Zhao, Cairong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4943 - 4951
  • [3] Text-based Person Search via Virtual Attribute Learning
    Wang C.-J.
    Su J.-W.
    Luo Z.-M.
    Cao D.-L.
    Lin Y.-J.
    Li S.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2035 - 2050
  • [4] Local-enhanced representation for text-based person search
    Zhang, Guoqing
    Chen, Yuhao
    Zheng, Yuhui
    Martin, Gaven
    Wang, Ruili
    Pattern Recognition, 2025, 161
  • [5] Hierarchical Gumbel Attention Network for Text-based Person Search
    Zheng, Kecheng
    Liu, Wu
    Liu, Jiawei
    Zha, Zheng-Jun
    Mei, Tao
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3441 - 3449
  • [6] Noise correspondence with evidence learning for text-based person search
    Yihan Xie
    Baohua Zhang
    Yang Li
    Chongrui Shan
    Shun Wang
    Jiale Zhang
    The Journal of Supercomputing, 81 (5)
  • [7] An Adaptive Correlation Filtering Method for Text-Based Person Search
    Sun, Mengyang
    Suo, Wei
    Wang, Peng
    Niu, Kai
    Liu, Le
    Lin, Guosheng
    Zhang, Yanning
    Wu, Qi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 4440 - 4455
  • [8] Sponsored Search Auctions: Recent Advances and Future Directions
    Qin, Tao
    Chen, Wei
    Liu, Tie-Yan
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 5 (04)
  • [9] Conditional Feature Learning Based Transformer for Text-Based Person Search
    Gao, Chenyang
    Cai, Guanyu
    Jiang, Xinyang
    Zheng, Feng
    Zhang, Jun
    Gong, Yifei
    Lin, Fangzhou
    Sun, Xing
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6097 - 6108
  • [10] Text-based Person Search without Parallel Image-Text Data
    Bai, Yang
    Wang, Jingyao
    Cao, Min
    Chen, Chen
    Cao, Ziqiang
    Nie, Liqiang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 757 - 767