An Overview of Text-based Person Search: Recent Advances and Future Directions

被引:0
|
作者
Niu K. [1 ]
Liu Y. [1 ]
Long Y. [1 ]
Huang Y. [3 ]
Wang L. [3 ]
Zhang Y. [1 ]
机构
[1] Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Beijing
基金
中国国家自然科学基金;
关键词
Benchmark testing; cross-modal retrieval; feature extraction; Feature extraction; Pedestrians; semantic alignments; Semantics; Task analysis; Text-based person search; Training; video surveillance; Visualization;
D O I
10.1109/TCSVT.2024.3376373
中图分类号
学科分类号
摘要
Due to the practical significance in smart video surveillance systems, Text-Based Person Search (TBPS) has been one of the research hotspots recently, which refers to searching for the interested pedestrian images given natural language sentences. To help researchers quickly grasp the developments of this important task, we comprehensively summarize the recent research advances of TBPS from two perspectives, <italic>i.e</italic>., Feature Extraction (FE) and Semantic Alignments (SA). Specifically, the FE mainly consists of pre-processing approaches and end-to-end frameworks, and the SA could be briefly divided into cross-modal attention mechanism, non-attention alignments, training objectives, and generative approaches. Afterwards, we elaborate four widely-used benchmarks and also the evaluation criterion for TBPS. And comparisons and analyses among the state-of-the-art (SOTA) solutions are provided based on these large-scale benchmarks. At last, we point out some future research directions that need to be further addressed, which will greatly facilitate the practical applications of TBPS. IEEE
引用
收藏
页码:1 / 1
相关论文
共 50 条
  • [41] Learning shared features from specific and ambiguous descriptions for text-based person search
    Ke Cheng
    Qikai Geng
    Shucheng Huang
    Juanjuan Tu
    Hu Lu
    Multimedia Systems, 2024, 30
  • [42] Learning shared features from specific and ambiguous descriptions for text-based person search
    Cheng, Ke
    Geng, Qikai
    Huang, Shucheng
    Tu, Juanjuan
    Lu, Hu
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [43] Recent advances and future directions in superplasticity
    Higashi, K
    SUPERPLASTICITY IN ADVANCED MATERIALS, ICSAM-2000, 2001, 357-3 : 345 - 356
  • [44] Laminitis: Recent advances and future directions
    Marr, C. M.
    EQUINE VETERINARY JOURNAL, 2012, 44 (06) : 733 - 734
  • [45] Sonogenetics: Recent advances and future directions
    Liu, Tianyi
    Choi, Mi Hyun
    Zhu, Jiejun
    Zhu, Tingting
    Yang, Jin
    Li, Na
    Chen, Zihao
    Xian, Quanxiang
    Hou, Xuandi
    He, Dongmin
    Guo, Jinghui
    Fei, Chunlong
    Sun, Lei
    Qiu, Zhihai
    BRAIN STIMULATION, 2022, 15 (05) : 1308 - 1317
  • [46] Text-based sentiment analysis in finance: Synthesising the existing literature and exploring future directions
    Todd, Andrew
    Bowden, James
    Moshfeghi, Yashar
    INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2024, 31 (01):
  • [47] Text Analytics in Bulgarian: An Overview and Future Directions
    Hristova, Gloria
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2021, 21 (03) : 3 - 23
  • [48] LIBS-Based Imaging: Recent Advances and Future Directions
    Motto-Ros, V
    Gardette, V
    Sancey, L.
    Leprince, M.
    Genty, D.
    Roux, S.
    Busser, B.
    Pelascini, F.
    SPECTROSCOPY, 2020, 35 (02) : 34 - +
  • [49] LIBS-based imaging: Recent advances and future directions
    Motto-Ros, V. (vincent.motto-ros@univ-lyon1.fr), 1600, Advanstar Communications Inc. (35):
  • [50] Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search
    You, Kaiyang
    Chen, Wenjing
    Wang, Chengji
    Sun, Hao
    Xie, Wei
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2230 - 2234