Joint discriminative representation learning for end-to-end person search

被引:17
|
作者
Zhang, Pengcheng [1 ]
Yu, Xiaohan [2 ,3 ]
Bai, Xiao [1 ]
Wang, Chen [1 ]
Zheng, Jin [1 ]
Ning, Xin [4 ]
机构
[1] Beihang Univ, Jiangxi Res Inst, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Macquarie Univ, Sch Comp, Sydney, Australia
[3] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Australia
[4] Chinese Acad Sci, Inst Semicond, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Person search; Person re-identification; Part segmentation; Batch sampling; NETWORK;
D O I
10.1016/j.patcog.2023.110053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person search simultaneously detects and retrieves a query person from uncropped scene images. Existing methods are either two-step or end-to-end. The former employs two standalone models for the two sub-tasks, while the latter conducts person search with a unified model. Despite encouraging progress, most existing end-to-end methods focus on balancing the model between detection and retrieval sub-tasks, while ignoring to enhance the learned representation for retrieval, which leads to inferior accuracy to two-step approaches. To that end, we propose a novel hierarchical framework that jointly optimizes instance-aware and part -aware embedding to enable discriminative representation learning. Specifically, we develop a region-of-interest cosegment (ROICoseg) module that captures part-aware information without requiring extra annotations to enable fine-grained discriminative representation. On top of that, a Contextual Instance Batch Sampling (CIBS) method is introduced to effectively employ contextual information for constructing training batches, thus facilitating effective instance-aware representation learning. We further introduce the first cross-door person search dataset (CDPS) that retrieves a target person in outdoor cameras with an indoor captured image or vice versa. Extensive experiments show that our proposed model achieves competitive performance on CUHK-SYSU and outperforms state-of-the-art end-to-end methods on the more challenging PRW and CDPS.1
引用
收藏
页数:11
相关论文
共 50 条
  • [21] DTHN: Dual-Transformer Head End-to-End Person Search Network
    Feng, Cheng
    Han, Dezhi
    Chen, Chongqing
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 245 - 261
  • [22] Fully Decoupled End-to-End Person Search: An Approach without Conflicting Objectives
    Zhang, Pengcheng
    Yu, Xiaohan
    Bai, Xiao
    Zheng, Jin
    Ning, Xin
    Hancock, Edwin R.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [23] Discriminative Frequency Information Learning for End-to-End Speech Anti-Spoofing
    Huang, Bingyuan
    Cui, Sanshuai
    Huang, Jiwu
    Kang, Xiangui
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 185 - 189
  • [24] Associative Embedding: End-to-End Learning for Joint Detection and Grouping
    Newell, Alejandro
    Huang, Zhiao
    Deng, Jia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [25] Joint End-to-End Learning for Scale-adaptive Person Super-resolution and Re-identification
    Zhong, Yan-Zhen
    Shao, Wen-Ze
    Ge, Qi
    Wang, Li-Qian
    Xie, Shi-Peng
    Xu, Juan
    Li, Hai-Bo
    ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
  • [26] End-to-End Learning of Joint Geometric and Probabilistic Constellation Shaping
    Aref, Vahid
    Chagnon, Mathieu
    2022 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2022,
  • [27] Representation Online Maters: Practical End-to-End Diversification in Search and Recommender Systems
    Silva, Pedro
    Juneja, Bhawna
    Desai, Shloka
    Singh, Ashudeep
    Fawaz, Nadia
    PROCEEDINGS OF THE 6TH ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2023, 2023, : 1735 - 1746
  • [28] Spatial and temporal learning representation for end-to-end recording device identification
    Chunyan Zeng
    Dongliang Zhu
    Zhifeng Wang
    Minghu Wu
    Wei Xiong
    Nan Zhao
    EURASIP Journal on Advances in Signal Processing, 2021
  • [29] End-to-End Representation Learning for Chemical-Chemical Interaction Prediction
    Kwon, Sunyoung
    Yoon, Sungroh
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (05) : 1436 - 1447
  • [30] ACTIVEMATCH: END-TO-END SEMI-SUPERVISED ACTIVE REPRESENTATION LEARNING
    Yuan, Xinkai
    Li, Zilinghan
    Wang, Gaoang
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1136 - 1140