Joint discriminative representation learning for end-to-end person search

被引：17

作者：

Zhang, Pengcheng ^{[1
]}

Yu, Xiaohan ^{[2
,3
]}

Bai, Xiao ^{[1
]}

Wang, Chen ^{[1
]}

Zheng, Jin ^{[1
]}

Ning, Xin ^{[4
]}

机构：

[1] Beihang Univ, Jiangxi Res Inst, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing, Peoples R China

[2] Macquarie Univ, Sch Comp, Sydney, Australia

[3] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Australia

[4] Chinese Acad Sci, Inst Semicond, Beijing, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 147卷

基金：

美国国家科学基金会;

关键词：

Person search; Person re-identification; Part segmentation; Batch sampling; NETWORK;

D O I：

10.1016/j.patcog.2023.110053

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Person search simultaneously detects and retrieves a query person from uncropped scene images. Existing methods are either two-step or end-to-end. The former employs two standalone models for the two sub-tasks, while the latter conducts person search with a unified model. Despite encouraging progress, most existing end-to-end methods focus on balancing the model between detection and retrieval sub-tasks, while ignoring to enhance the learned representation for retrieval, which leads to inferior accuracy to two-step approaches. To that end, we propose a novel hierarchical framework that jointly optimizes instance-aware and part -aware embedding to enable discriminative representation learning. Specifically, we develop a region-of-interest cosegment (ROICoseg) module that captures part-aware information without requiring extra annotations to enable fine-grained discriminative representation. On top of that, a Contextual Instance Batch Sampling (CIBS) method is introduced to effectively employ contextual information for constructing training batches, thus facilitating effective instance-aware representation learning. We further introduce the first cross-door person search dataset (CDPS) that retrieves a target person in outdoor cameras with an indoor captured image or vice versa. Extensive experiments show that our proposed model achieves competitive performance on CUHK-SYSU and outperforms state-of-the-art end-to-end methods on the more challenging PRW and CDPS.1

引用

页数：11

共 50 条

[21] DTHN: Dual-Transformer Head End-to-End Person Search Network
Feng, Cheng
Han, Dezhi
Chen, Chongqing
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 245 - 261
[22] Fully Decoupled End-to-End Person Search: An Approach without Conflicting Objectives
Zhang, Pengcheng
Yu, Xiaohan
Bai, Xiao
Zheng, Jin
Ning, Xin
Hancock, Edwin R.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
[23] Discriminative Frequency Information Learning for End-to-End Speech Anti-Spoofing
Huang, Bingyuan
Cui, Sanshuai
Huang, Jiwu
Kang, Xiangui
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 185 - 189
[24] Associative Embedding: End-to-End Learning for Joint Detection and Grouping
Newell, Alejandro
Huang, Zhiao
Deng, Jia
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[25] Joint End-to-End Learning for Scale-adaptive Person Super-resolution and Re-identification
Zhong, Yan-Zhen
Shao, Wen-Ze
Ge, Qi
Wang, Li-Qian
Xie, Shi-Peng
Xu, Juan
Li, Hai-Bo
ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
[26] End-to-End Learning of Joint Geometric and Probabilistic Constellation Shaping
Aref, Vahid
Chagnon, Mathieu
2022 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2022,
[27] Representation Online Maters: Practical End-to-End Diversification in Search and Recommender Systems
Silva, Pedro
Juneja, Bhawna
Desai, Shloka
Singh, Ashudeep
Fawaz, Nadia
PROCEEDINGS OF THE 6TH ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2023, 2023, : 1735 - 1746
[28] Spatial and temporal learning representation for end-to-end recording device identification
Chunyan Zeng
Dongliang Zhu
Zhifeng Wang
Minghu Wu
Wei Xiong
Nan Zhao
EURASIP Journal on Advances in Signal Processing, 2021
[29] End-to-End Representation Learning for Chemical-Chemical Interaction Prediction
Kwon, Sunyoung
Yoon, Sungroh
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (05) : 1436 - 1447
[30] ACTIVEMATCH: END-TO-END SEMI-SUPERVISED ACTIVE REPRESENTATION LEARNING
Yuan, Xinkai
Li, Zilinghan
Wang, Gaoang
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1136 - 1140

← 1 2 3 4 5 →