What-Where-When Attention Network for video-based person re-identification

被引:6
|
作者
Zhang, Chenrui [1 ,2 ]
Chen, Ping [1 ]
Lei, Tao [3 ]
Wu, Yangxu [1 ]
Meng, Hongying [4 ]
机构
[1] North Univ China, State Key Lab Elect Testing Technol, Taiyuan 030051, Peoples R China
[2] Luliang Univ, Dept Phys, Luliang 033000, Peoples R China
[3] Shaanxi Univ Sci & Technol, Sch Elect Informat & Artificial Intelligence, Xian 710021, Peoples R China
[4] Brunel Univ London, Dept Elect & Elect Engn, Uxbridge, Middx, England
基金
中国国家自然科学基金;
关键词
Person re-identification; What-Where-When Attention; Spatial-temporal feature; Graph attention network; Attribute; Identity;
D O I
10.1016/j.neucom.2021.10.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-based person re-identification plays a critical role in intelligent video surveillance by learning temporal correlations from consecutive video frames. Most existing methods aim to solve the challenging variations of pose, occlusion, backgrounds and so on by using attention mechanism. They almost all draw attention to the occlusion and learn occlusion-invariant video representations by abandoning the occluded area or frames, while the other areas in these frames contain sufficient spatial information and temporal cues. To overcome these drawbacks, this paper proposes a comprehensive attention mechanism covering what, where, and when to pay attention in the discriminative spatial-temporal feature learning, namely What-Where-When Attention Network (W3AN). Concretely, W3AN designs a spatial attention module to focus on pedestrian identity and obvious attributes by the importance estimating layer (What and Where), and a temporal attention module to calculate the frame-level importance (when), which is embedded into a graph attention network to exploit temporal attention features rather than computing weighted average feature for video frames like existing methods. Moreover, the experiments on three widely-recognized datasets demonstrate the effectiveness of our proposed W3AN model and the discussion of major modules elaborates the contributions of this paper. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:33 / 47
页数:15
相关论文
共 50 条
  • [21] Diverse part attentive network for video-based person re-identification *
    Shu, Xiujun
    Li, Ge
    Wei, Longhui
    Zhong, Jia-Xing
    Zang, Xianghao
    Zhang, Shiliang
    Wang, Yaowei
    Liang, Yongsheng
    Tian, Qi
    PATTERN RECOGNITION LETTERS, 2021, 149 : 17 - 23
  • [22] A Duplex Spatiotemporal Filtering Network for Video-based Person Re-identification
    Zheng, Chong
    Wei, Ping
    Zheng, Nanning
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7551 - 7557
  • [23] Spatial Quality Aware Network for Video-Based Person Re-identification
    Wang, Yujie
    Leng, Biao
    Song, Guanglu
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 34 - 43
  • [24] Frequency Information Disentanglement Network for Video-Based Person Re-Identification
    Liu, Liangchen
    Yang, Xi
    Wang, Nannan
    Gao, Xinbo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4287 - 4298
  • [25] Video-based person re-identification with scene and person attributes
    Gong, Xun
    Luo, Bin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8117 - 8128
  • [26] Video-Based Person Re-identification with Improved Temporal Attention And Spatial Memory
    Liu, Peishun
    Chen, He
    Tang, Ruichun
    Wang, Xuefang
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 448 - 454
  • [27] Video-based person re-identification with scene and person attributes
    Xun Gong
    Bin Luo
    Multimedia Tools and Applications, 2024, 83 : 8117 - 8128
  • [28] Attention-guided spatial–temporal graph relation network for video-based person re-identification
    Yu Qi
    Hongwei Ge
    Wenbin Pei
    Yuxuan Liu
    Yaqing Hou
    Liang Sun
    Neural Computing and Applications, 2023, 35 : 14227 - 14241
  • [29] Video-Based Person Re-Identification With Unregulated Sequences
    Huang, Wenjun
    Liang, Chao
    Xiao, Chunxia
    Han, Zhen
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2020, 12 (02) : 59 - 76
  • [30] Deep asymmetric video-based person re-identification
    Meng, Jingke
    Wu, Ancong
    Zheng, Wei-Shi
    PATTERN RECOGNITION, 2019, 93 : 430 - 441