Spatial-temporal graph-guided global attention network for video-based person re-identification

被引:0
|
作者
Li, Xiaobao [1 ]
Wang, Wen [2 ]
Li, Qingyong [2 ]
Zhang, Jiang [3 ]
机构
[1] Jiangsu Normal Univ, Sch Comp Sci & Technol, 101 Shanghai Rd, Xuzhou 221116, Jiangsu, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Traff Data Anal & Min, 3 Shangyuancun, Beijing 100044, Peoples R China
[3] China Acad Aerosp Aerodynam, 17 YunGang West Rd, Beijing 100074, Peoples R China
基金
中国国家自然科学基金;
关键词
Person Re-identification; Global attention learning; Graph; Spatial-temporal;
D O I
10.1007/s00138-023-01489-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Global attention learning has been extensively applied in video-based person re-identification due to its superiority in capturing contextual correlations. However, existing global attention learning methods usually adopt the conventional neural network to model non-Euclidean contextual correlations, resulting in a limited representation ability. Inspired by the graph-structure property of the contextual correlations, we propose a spatial-temporal graph-guided global attention network (STG(3)A) for video-based person re-identification. STG(3)A comprises two graph-guided attention modules to capture the spatial contexts within a frame and temporal contexts across all frames in a sequence for global attention learning. Furthermore, the graphs from both modules are encoded as graph representations, which combine with weighted representations to grasp the spatial-temporal contextual information adequately for video feature learning. To reduce the effect of noisy graph nodes and learn robust graph representations, a graph node attention is developed to trade-off the importance of each graph node, leading to noise-tolerant graph models. Finally, we design a graph-guided fusion scheme to integrate the representations output by these two attentive modules for a more compact video feature. Extensive experiments on MARS and DukeMTMCVideoReID datasets demonstrate the superior performance of the STG(3)A.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Spatial-temporal graph-guided global attention network for video-based person re-identification
    Xiaobao Li
    Wen Wang
    Qingyong Li
    Jiang Zhang
    [J]. Machine Vision and Applications, 2024, 35
  • [2] Attention-guided spatial-temporal graph relation network for video-based person re-identification
    Qi, Yu
    Ge, Hongwei
    Pei, Wenbin
    Liu, Yuxuan
    Hou, Yaqing
    Sun, Liang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (19): : 14227 - 14241
  • [3] Spatial-Temporal Graph Convolutional Network for Video-based Person Re-identification
    Yang, Jinrui
    Zheng, Wei-Shi
    Yang, Qize
    Chen, Ying-Cong
    Tian, Qi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3286 - 3296
  • [4] Attention-guided spatial–temporal graph relation network for video-based person re-identification
    Yu Qi
    Hongwei Ge
    Wenbin Pei
    Yuxuan Liu
    Yaqing Hou
    Liang Sun
    [J]. Neural Computing and Applications, 2023, 35 : 14227 - 14241
  • [5] Spatial-temporal aware network for video-based person re-identification
    Jun Wang
    Qi Zhao
    Di Jia
    Ziqing Huang
    Miaohui Zhang
    Xing Ren
    [J]. Multimedia Tools and Applications, 2024, 83 : 36355 - 36373
  • [6] Spatial-temporal aware network for video-based person re-identification
    Wang, Jun
    Zhao, Qi
    Jia, Di
    Huang, Ziqing
    Zhang, Miaohui
    Ren, Xing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36355 - 36373
  • [7] Video-based person re-identification with parallel spatial-temporal attention module
    Kong, Jun
    Teng, Zhende
    Jiang, Min
    Huo, Hongtao
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (01)
  • [8] Deep Spatial-Temporal Fusion Network for Video-Based Person Re-Identification
    Chen, Lin
    Yang, Hua
    Zhu, Ji
    Zhou, Qin
    Wu, Shuang
    Gao, Zhiyong
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1478 - 1485
  • [9] Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification
    Chen, Guangyi
    Lu, Jiwen
    Yang, Ming
    Zhou, Jie
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4192 - 4205
  • [10] TEMPORAL REGULARIZED SPATIAL ATTENTION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
    Wang, Xueying
    Zhao, Xu
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2249 - 2253