CONVOLUTIONAL TEMPORAL ATTENTION MODEL FOR VIDEO-BASED PERSON RE-IDENTIFICATION

被引:4
|
作者
Rahman, Tanzila [1 ]
Rochan, Mrigank [2 ]
Wang, Yang [2 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] Univ Manitoba, Winnipeg, MB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Attention network; FCN; temporal attention; re-identification; semantic segmentation;
D O I
10.1109/ICME.2019.00193
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The goal of video-based person re-identification is to match two input videos, so that the distance of the two videos is small if two videos contain the same person. A common approach for person re-identification is to first extract image features for all frames in the video, then aggregate all the features to form a video-level feature. The video-level features of two videos can then be used to calculate the distance of the two videos. In this paper, we propose a temporal attention approach for aggregating frame-level features into a video-level feature vector for re-identification. Our method is motivated by the fact that not all frames in a video are equally informative. We propose a fully convolutional temporal attention model for generating the attention scores. Fully convolutional network (FCN) has been widely used in semantic segmentation for generating 2D output maps. In this paper, we formulate video based person re-identification as a sequence labeling problem like semantic segmentation. We establish a connection between them and modify FCN to generate attention scores to represent the importance of each frame. Extensive experiments on three different benchmark datasets (i.e. iLIDS-VID, PRID-2011 and SDU-VID) show that our proposed method outperforms other state-of-the-art approaches.
引用
收藏
页码:1102 / 1107
页数:6
相关论文
共 50 条
  • [1] Video-Based Convolutional Attention for Person Re-Identification
    Zamprogno, Marco
    Passon, Marco
    Martinel, Niki
    Serra, Giuseppe
    Lancioni, Giuseppe
    Micheloni, Christian
    Tasso, Carlo
    Foresti, Gian Luca
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 3 - 14
  • [2] TEMPORAL REGULARIZED SPATIAL ATTENTION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
    Wang, Xueying
    Zhao, Xu
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2249 - 2253
  • [3] Temporal Attention Quality Aware Network for Video-based Person Re-Identification
    Xu, Boqin
    Liu, Changhong
    Xue, Shengjun
    Jiang, Aiwen
    Wang, Shimin
    Ye, Jihua
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [4] Video-Based Person Re-identification with Improved Temporal Attention And Spatial Memory
    Liu, Peishun
    Chen, He
    Tang, Ruichun
    Wang, Xuefang
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 448 - 454
  • [5] Temporal-Contextual Attention Network for Video-Based Person Re-identification
    Chen, Di
    Zha, Zheng-Jun
    Liu, Jiawei
    Xie, Hongtao
    Zhang, Yongdong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 146 - 157
  • [6] Recurrent Convolutional Network for Video-based Person Re-Identification
    McLaughlin, Niall
    del Rincon, Jesus Martinez
    Miller, Paul
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1325 - 1334
  • [7] Spatial-Temporal Graph Convolutional Network for Video-based Person Re-identification
    Yang, Jinrui
    Zheng, Wei-Shi
    Yang, Qize
    Chen, Ying-Cong
    Tian, Qi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3286 - 3296
  • [8] Triplet Attention Network for Video-Based Person Re-Identification
    Sun, Rui
    Liang, Qili
    Yang, Zi
    Zhao, Zhenghui
    Zhang, Xudong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (10) : 1775 - 1779
  • [9] Video-based person re-identification with parallel spatial-temporal attention module
    Kong, Jun
    Teng, Zhende
    Jiang, Min
    Huo, Hongtao
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (01)
  • [10] Temporal Aggregation with Clip-level Attention for Video-based Person Re-identification
    Li, Mengliu
    Xu, Han
    Wang, Jinjun
    Li, Wenpeng
    Sun, Yongli
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3365 - 3373