Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification

被引:217
|
作者
Xu, Shuangjie [1 ]
Cheng, Yu [2 ]
Gu, Kang [1 ]
Yang, Yang [3 ]
Chang, Shiyu [4 ]
Zhou, Pan [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Hubei, Peoples R China
[2] IBM Res, AI Fdn, Armonk, NY USA
[3] Northwestern Univ, Evanston, IL 60208 USA
[4] IBM TJ Watson Res Ctr, Ossining, NY 10562 USA
关键词
D O I
10.1109/ICCV.2017.507
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person Re-Identification (person re-id) is a crucial task as its applications in visual surveillance and human-computer interaction. In this work, we present a novel joint Spatial and Temporal Attention Pooling Network (ASTPN) for video-based person re-identification, which enables the feature extractor to be aware of the current input video sequences, in a way that interdependency from the matching items can directly influence the computation of each other's representation. Specifically, the spatial pooling layer is able to select regions from each frame, while the attention temporal pooling performed can select informative frames over the sequence, both pooling guided by the information from distance matching. Experiments are conduced on the iLIDS-VID, PRID-2011 and MARS datasets and the results demonstrate that this approach outperforms existing state-of-art methods. We also analyze how the joint pooling in both dimensions can boost the person re-id performance more effectively than using either of them separately(1).
引用
收藏
页码:4743 / 4752
页数:10
相关论文
共 50 条
  • [1] Joint Attentive Spatial-Temporal Feature Aggregation for Video-Based Person Re-Identification
    Chen, Lin
    Yang, Hua
    Gao, Zhiyong
    [J]. IEEE ACCESS, 2019, 7 : 41230 - 41240
  • [2] Jointly Temporal Pooling Networks and Multi-loss Fusion for Video-based Person Re-Identification
    Xu, Huanhuan
    Sun, Xuemei
    [J]. PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 432 - 435
  • [3] Spatial-temporal aware network for video-based person re-identification
    Jun Wang
    Qi Zhao
    Di Jia
    Ziqing Huang
    Miaohui Zhang
    Xing Ren
    [J]. Multimedia Tools and Applications, 2024, 83 : 36355 - 36373
  • [4] Pyramid Spatial-Temporal Aggregation for Video-based Person Re-Identification
    Wang, Yingquan
    Zhang, Pingping
    Gao, Shang
    Geng, Xia
    Lu, Hu
    Wang, Dong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12006 - 12015
  • [5] Spatial-temporal aware network for video-based person re-identification
    Wang, Jun
    Zhao, Qi
    Jia, Di
    Huang, Ziqing
    Zhang, Miaohui
    Ren, Xing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36355 - 36373
  • [6] Video-based Person Re-identification with Spatial and Temporal Memory Networks
    Eom, Chanho
    Lee, Geon
    Lee, Junghyup
    Ham, Bumsub
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12016 - 12025
  • [7] Deep Spatial-Temporal Fusion Network for Video-Based Person Re-Identification
    Chen, Lin
    Yang, Hua
    Zhu, Ji
    Zhou, Qin
    Wu, Shuang
    Gao, Zhiyong
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1478 - 1485
  • [8] Video-based person re-identification with parallel spatial-temporal attention module
    Kong, Jun
    Teng, Zhende
    Jiang, Min
    Huo, Hongtao
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (01)
  • [9] Spatial-Temporal Graph Convolutional Network for Video-based Person Re-identification
    Yang, Jinrui
    Zheng, Wei-Shi
    Yang, Qize
    Chen, Ying-Cong
    Tian, Qi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3286 - 3296
  • [10] Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification
    Chen, Guangyi
    Lu, Jiwen
    Yang, Ming
    Zhou, Jie
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4192 - 4205