Deep Graph Metric Learning for Weakly Supervised Person Re-Identification

被引:14
|
作者
Meng, Jingke [1 ,2 ]
Zheng, Wei-Shi [3 ,4 ]
Lai, Jian-Huang [1 ]
Wang, Liang [5 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 519082, Peoples R China
[2] Pazhou Lab, Guangzhou 519082, Peoples R China
[3] Sun Yat Sen Univ, Sch Comp Sci & Engn, Key Lab Machine Intelligence & Adv Comp, Minist Educ, Guangzhou 519082, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
[5] Chinese Acad Sci, Inst Automat, Beijing 100049, Peoples R China
关键词
Training; Cameras; Labeling; Probes; Visualization; Annotations; Loss measurement; Person re-identification; weakly supervised person re-identification; visual surveillance;
D O I
10.1109/TPAMI.2021.3084613
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In conventional person re-identification (re-id), the images used for model training in the training probe set and training gallery set are all assumed to be instance-level samples that are manually labeled from raw surveillance video (likely with the assistance of detection) in a frame-by-frame manner. This labeling across multiple non-overlapping camera views from raw video surveillance is expensive and time consuming. To overcome these issues, we consider a weakly supervised person re-id modeling that aims to find the raw video clips where a given target person appears. In our weakly supervised setting, during training, given a sample of a person captured in one camera view, our weakly supervised approach aims to train a re-id model without further instance-level labeling for this person in another camera view. The weak setting refers to matching a target person with an untrimmed gallery video where we only know that the identity appears in the video without the requirement of annotating the identity in any frame of the video during the training procedure. The weakly supervised person re-id is challenging since it not only suffers from the difficulties occurring in conventional person re-id (e.g., visual ambiguity and appearance variations caused by occlusions, pose variations, background clutter, etc.), but more importantly, is also challenged by weakly supervised information because the instance-level labels and the ground-truth locations for person instances (i.e., the ground-truth bounding boxes of person instances) are absent. To solve the weakly supervised person re-id problem, we develop deep graph metric learning (DGML). On the one hand, DGML measures the consistency between intra-video spatial graphs of consecutive frames, where the spatial graph captures neighborhood relationship about the detected person instances in each frame. On the other hand, DGML distinguishes the inter-video spatial graphs captured from different camera views at different sites simultaneously. To further explicitly embed weak supervision into the DGML and solve the weakly supervised person re-id problem, we introduce weakly supervised regularization (WSR), which utilizes multiple weak video-level labels to learn discriminative features by means of a weak identity loss and a cross-video alignment loss. We conduct extensive experiments to demonstrate the feasibility of the weakly supervised person re-id approach and its special cases (e.g., its bag-to-bag extension) and show that the proposed DGML is effective.
引用
收藏
页码:6074 / 6093
页数:20
相关论文
共 50 条
  • [41] Person Re-Identification Based on Graph Relation Learning
    Wang, Hao
    Bi, Xiaojun
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (02) : 1401 - 1415
  • [42] Person Re-Identification Based on Graph Relation Learning
    Hao Wang
    Xiaojun Bi
    [J]. Neural Processing Letters, 2021, 53 : 1401 - 1415
  • [43] Novel Similarity Metric Learning Using Deep Learning and Root SIFT for Person Re-identification
    M. K. Vidhyalakshmi
    E. Poovammal
    Vidhyacharan Bhaskar
    J. Sathyanarayanan
    [J]. Wireless Personal Communications, 2021, 117 : 1835 - 1851
  • [44] Novel Similarity Metric Learning Using Deep Learning and Root SIFT for Person Re-identification
    Vidhyalakshmi, M. K.
    Poovammal, E.
    Bhaskar, Vidhyacharan
    Sathyanarayanan, J.
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2021, 117 (03) : 1835 - 1851
  • [45] Unsupervised Person Re-Identification by Deep Asymmetric Metric Embedding
    Yu, Hong-Xing
    Wu, Ancong
    Zheng, Wei-Shi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (04) : 956 - 973
  • [46] Unsupervised Person Re-Identification by Deep Asymmetric Metric Embedding
    Yu, Hong-Xing
    Wu, Ancong
    Zheng, Wei-Shi
    [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42 (04): : 956 - 973
  • [47] Learning camera invariant deep features for semi-supervised person re-identification
    Hui Zhu
    Lei Huang
    Zhiqiang Wei
    Wenfeng Zhang
    Huanhuan Cai
    [J]. Multimedia Tools and Applications, 2022, 81 : 18671 - 18692
  • [48] Learning camera invariant deep features for semi-supervised person re-identification
    Zhu, Hui
    Huang, Lei
    Wei, Zhiqiang
    Zhang, Wenfeng
    Cai, Huanhuan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (13) : 18671 - 18692
  • [49] Discriminative deep transfer metric learning for cross-scenario person re-identification
    Ni, Tongguang
    Gu, Xiaoqing
    Wang, Hongyuan
    Zhang, Zhongbao
    Chen, Shoubing
    Jin, Cui
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (04)
  • [50] DHML: Deep Heterogeneous Metric Learning for VIS-NIR Person Re-identification
    Zhang, Quan
    Cheng, Haijie
    Lai, Jianhuang
    Xie, Xiaohua
    [J]. BIOMETRIC RECOGNITION (CCBR 2019), 2019, 11818 : 455 - 465