Deep Graph Metric Learning for Weakly Supervised Person Re-Identification

被引:14
|
作者
Meng, Jingke [1 ,2 ]
Zheng, Wei-Shi [3 ,4 ]
Lai, Jian-Huang [1 ]
Wang, Liang [5 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 519082, Peoples R China
[2] Pazhou Lab, Guangzhou 519082, Peoples R China
[3] Sun Yat Sen Univ, Sch Comp Sci & Engn, Key Lab Machine Intelligence & Adv Comp, Minist Educ, Guangzhou 519082, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
[5] Chinese Acad Sci, Inst Automat, Beijing 100049, Peoples R China
关键词
Training; Cameras; Labeling; Probes; Visualization; Annotations; Loss measurement; Person re-identification; weakly supervised person re-identification; visual surveillance;
D O I
10.1109/TPAMI.2021.3084613
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In conventional person re-identification (re-id), the images used for model training in the training probe set and training gallery set are all assumed to be instance-level samples that are manually labeled from raw surveillance video (likely with the assistance of detection) in a frame-by-frame manner. This labeling across multiple non-overlapping camera views from raw video surveillance is expensive and time consuming. To overcome these issues, we consider a weakly supervised person re-id modeling that aims to find the raw video clips where a given target person appears. In our weakly supervised setting, during training, given a sample of a person captured in one camera view, our weakly supervised approach aims to train a re-id model without further instance-level labeling for this person in another camera view. The weak setting refers to matching a target person with an untrimmed gallery video where we only know that the identity appears in the video without the requirement of annotating the identity in any frame of the video during the training procedure. The weakly supervised person re-id is challenging since it not only suffers from the difficulties occurring in conventional person re-id (e.g., visual ambiguity and appearance variations caused by occlusions, pose variations, background clutter, etc.), but more importantly, is also challenged by weakly supervised information because the instance-level labels and the ground-truth locations for person instances (i.e., the ground-truth bounding boxes of person instances) are absent. To solve the weakly supervised person re-id problem, we develop deep graph metric learning (DGML). On the one hand, DGML measures the consistency between intra-video spatial graphs of consecutive frames, where the spatial graph captures neighborhood relationship about the detected person instances in each frame. On the other hand, DGML distinguishes the inter-video spatial graphs captured from different camera views at different sites simultaneously. To further explicitly embed weak supervision into the DGML and solve the weakly supervised person re-id problem, we introduce weakly supervised regularization (WSR), which utilizes multiple weak video-level labels to learn discriminative features by means of a weak identity loss and a cross-video alignment loss. We conduct extensive experiments to demonstrate the feasibility of the weakly supervised person re-id approach and its special cases (e.g., its bag-to-bag extension) and show that the proposed DGML is effective.
引用
收藏
页码:6074 / 6093
页数:20
相关论文
共 50 条
  • [21] An Enhanced Metric Learning for Person Re-identification
    Lei, Zhuochen
    Yu, Xiaoqing
    [J]. 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 52 - 57
  • [22] Weakly Supervised Text-based Person Re-Identification
    Zhao, Shizhen
    Gao, Changxin
    Shao, Yuanjie
    Zheng, Wei-Shi
    Sang, Nong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11375 - 11384
  • [23] Weakly Supervised Distribution Discrepancy Minimization Learning With State Information for Person Re-Identification
    Kong, Jun
    Tao, Xuefeng
    Jiang, Min
    Liu, Tianshan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1903 - 1915
  • [24] Set-label modeling and deep metric learning on person re-identification
    Liu, Hao
    Ma, Bingpeng
    Qin, Lei
    Pang, Junbiao
    Zhang, Chunjie
    Huang, Qingming
    [J]. NEUROCOMPUTING, 2015, 151 : 1283 - 1292
  • [25] Person re-identification by graph-based metric fusion
    Xie, Yi
    Levine, Martin D.
    Yu, Huimin
    [J]. ELECTRONICS LETTERS, 2016, 52 (17) : 1447 - 1448
  • [26] Person Re-identification with Hierarchical Deep Learning Feature and efficient XQDA Metric
    Zeng, Mingyong
    Tian, Chang
    Wu, Zemin
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1838 - 1846
  • [27] Deep Metric Learning with Online Hard and Soft Selection for Person Re-identification
    Yu, Mingyang
    Kamata, Sei-ichiro
    [J]. 2018 JOINT 7TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2018 2ND INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2018, : 426 - 431
  • [28] Joint dictionary and metric learning for person re-identification
    Zhou, Qin
    Zheng, Shibao
    Ling, Haibin
    Su, Hang
    Wu, Shuang
    [J]. PATTERN RECOGNITION, 2017, 72 : 196 - 206
  • [29] Person re-identification based on metric learning: a survey
    Guofeng Zou
    Guixia Fu
    Xiang Peng
    Yue Liu
    Mingliang Gao
    Zheng Liu
    [J]. Multimedia Tools and Applications, 2021, 80 : 26855 - 26888
  • [30] Regularized local metric learning for person re-identification
    Liong, Venice Erin
    Lu, Jiwen
    Ge, Yongxin
    [J]. PATTERN RECOGNITION LETTERS, 2015, 68 : 288 - 296