CONVOLUTIONAL TEMPORAL ATTENTION MODEL FOR VIDEO-BASED PERSON RE-IDENTIFICATION

被引:4
|
作者
Rahman, Tanzila [1 ]
Rochan, Mrigank [2 ]
Wang, Yang [2 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] Univ Manitoba, Winnipeg, MB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Attention network; FCN; temporal attention; re-identification; semantic segmentation;
D O I
10.1109/ICME.2019.00193
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The goal of video-based person re-identification is to match two input videos, so that the distance of the two videos is small if two videos contain the same person. A common approach for person re-identification is to first extract image features for all frames in the video, then aggregate all the features to form a video-level feature. The video-level features of two videos can then be used to calculate the distance of the two videos. In this paper, we propose a temporal attention approach for aggregating frame-level features into a video-level feature vector for re-identification. Our method is motivated by the fact that not all frames in a video are equally informative. We propose a fully convolutional temporal attention model for generating the attention scores. Fully convolutional network (FCN) has been widely used in semantic segmentation for generating 2D output maps. In this paper, we formulate video based person re-identification as a sequence labeling problem like semantic segmentation. We establish a connection between them and modify FCN to generate attention scores to represent the importance of each frame. Extensive experiments on three different benchmark datasets (i.e. iLIDS-VID, PRID-2011 and SDU-VID) show that our proposed method outperforms other state-of-the-art approaches.
引用
收藏
页码:1102 / 1107
页数:6
相关论文
共 50 条
  • [21] Learning Bidirectional Temporal Cues for Video-Based Person Re-Identification
    Zhang, Wei
    Yu, Xiaodong
    He, Xuanyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2768 - 2776
  • [22] Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification
    Liu, Yiheng
    Yuan, Zhenxun
    Zhou, Wengang
    Li, Houqiang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8786 - 8793
  • [23] Video-based Person Re-identification with Spatial and Temporal Memory Networks
    Eom, Chanho
    Lee, Geon
    Lee, Junghyup
    Ham, Bumsub
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12016 - 12025
  • [24] Attention-guided spatial–temporal graph relation network for video-based person re-identification
    Yu Qi
    Hongwei Ge
    Wenbin Pei
    Yuxuan Liu
    Yaqing Hou
    Liang Sun
    Neural Computing and Applications, 2023, 35 : 14227 - 14241
  • [25] Video-based person re-identification with scene and person attributes
    Gong, Xun
    Luo, Bin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8117 - 8128
  • [26] STA: Spatial-Temporal Attention for Large-Scale Video-Based Person Re-Identification
    Fu, Yang
    Wang, Xiaoyang
    Wei, Yunchao
    Huang, Thomas
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8287 - 8294
  • [27] Parallel Attention with Weighted Efficient Network for Video-Based Person Re-Identification
    Yang, Junting
    Yang, Zuliu
    Zhou, Jing
    Zhao, Yong
    Dai, Qifei
    Li, Fuchi
    2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 133 - 139
  • [28] An Efficient Axial-Attention Network for Video-Based Person Re-Identification
    Zhang, Fuping
    Zhang, Tianzhao
    Sun, Ruoxi
    Huang, Chao
    Wei, Jianming
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1352 - 1356
  • [29] Video-based person re-identification with scene and person attributes
    Xun Gong
    Bin Luo
    Multimedia Tools and Applications, 2024, 83 : 8117 - 8128
  • [30] Multi-stage attention network for video-based person re-identification
    Yang, Fan
    Li, Wei
    Liang, Binbin
    Han, Songchen
    Zhu, Xuan
    IET COMPUTER VISION, 2022, 16 (05) : 445 - 455