Spatial-Temporal Graph Convolutional Network for Video-based Person Re-identification

被引:153
|
作者
Yang, Jinrui [1 ,3 ]
Zheng, Wei-Shi [1 ,2 ,3 ]
Yang, Qize [1 ,3 ]
Chen, Ying-Cong [4 ]
Tian, Qi [5 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518005, Peoples R China
[3] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Beijing, Peoples R China
[4] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[5] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR42600.2020.00335
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While video-based person re-identification (Re-ID) has drawn increasing attention and made great progress in recent years, it is still very challenging to effectively overcome the occlusion problem and the visual ambiguity problem for visually similar negative samples. On the other hand, we observe that different frames of a video can provide complementary information for each other, and the structural information of pedestrians can provide extra discriminative cues for appearance features. Thus, modeling the temporal relations of different frames and the spatial relations within a frame has the potential for solving the above problems. In this work, we propose a novel Spatial-Temporal Graph Convolutional Network (STGCN) to solve these problems. The STGCN includes two GCN branches, a spatial one and a temporal one. The spatial branch extracts structural information of a human body. The temporal branch mines discriminative cues from adjacent frames. By jointly optimizing these branches, our model extracts robust spatialtemporal information that is complementary with appearance information. As shown in the experiments, our model achieves state-of-the-art results on MARS and DukeMTMC-VideoReID datasets.
引用
收藏
页码:3286 / 3296
页数:11
相关论文
共 50 条
  • [1] Spatial-temporal aware network for video-based person re-identification
    Jun Wang
    Qi Zhao
    Di Jia
    Ziqing Huang
    Miaohui Zhang
    Xing Ren
    [J]. Multimedia Tools and Applications, 2024, 83 : 36355 - 36373
  • [2] Spatial-temporal aware network for video-based person re-identification
    Wang, Jun
    Zhao, Qi
    Jia, Di
    Huang, Ziqing
    Zhang, Miaohui
    Ren, Xing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36355 - 36373
  • [3] Deep Spatial-Temporal Fusion Network for Video-Based Person Re-Identification
    Chen, Lin
    Yang, Hua
    Zhu, Ji
    Zhou, Qin
    Wu, Shuang
    Gao, Zhiyong
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1478 - 1485
  • [4] Attention-guided spatial-temporal graph relation network for video-based person re-identification
    Qi, Yu
    Ge, Hongwei
    Pei, Wenbin
    Liu, Yuxuan
    Hou, Yaqing
    Sun, Liang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (19): : 14227 - 14241
  • [5] Spatial-temporal graph-guided global attention network for video-based person re-identification
    Li, Xiaobao
    Wang, Wen
    Li, Qingyong
    Zhang, Jiang
    [J]. MACHINE VISION AND APPLICATIONS, 2024, 35 (01)
  • [6] Spatial-temporal graph-guided global attention network for video-based person re-identification
    Xiaobao Li
    Wen Wang
    Qingyong Li
    Jiang Zhang
    [J]. Machine Vision and Applications, 2024, 35
  • [7] Pyramid Spatial-Temporal Aggregation for Video-based Person Re-Identification
    Wang, Yingquan
    Zhang, Pingping
    Gao, Shang
    Geng, Xia
    Lu, Hu
    Wang, Dong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12006 - 12015
  • [8] MSTN: A Multi-granular Spatial-Temporal Network for video-based person re-identification
    Zhao, Wei
    Zhang, Bo
    Yang, Cong
    Chen, Xianfu
    Chen, Hui
    [J]. INTERNET OF THINGS, 2022, 20
  • [9] Video-based person re-identification with parallel spatial-temporal attention module
    Kong, Jun
    Teng, Zhende
    Jiang, Min
    Huo, Hongtao
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (01)
  • [10] Recurrent Convolutional Network for Video-based Person Re-Identification
    McLaughlin, Niall
    del Rincon, Jesus Martinez
    Miller, Paul
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1325 - 1334