Scale-fusion framework for improving video-based person re-identification performance

被引:0
|
作者
Li Cheng
Xiao-Yuan Jing
Xiaoke Zhu
Fei Ma
Chang-Hui Hu
Ziyun Cai
Fumin Qi
机构
[1] Wuhan University,School of Computer Science
[2] Guangdong University of Petrochemical Technology,School of Computer
[3] Nanjing University of Posts and Telecommunications,College of Automation
[4] Henan University,School of Computer and Information Engineering
[5] Pingdingshan University,School of Computer Science
[6] National Supercomputing Center in Shenzhen,undefined
来源
关键词
3D convolution; Short-term fast-varying motion information; Recurrent; Scale-fusion; Species invasion;
D O I
暂无
中图分类号
学科分类号
摘要
Video-based person re-identification (re-id), which aims to match people through videos captured by non-overlapping camera views, has attracted lots of research interest recently. In this paper, we first propose a novel hybrid 2D and 3D convolution-based recurrent neural network (HCRN) for video-based person re-id task. Specifically, the 3D convolutional module can explore the local short-term fast-varying motion information, while the recurrent layer can leverage the global long-term spatial–temporal information. Based on HCRN, we design a scale-fusion framework to make full use of features of different scales to further improve the performance of video-based person re-id. More concretely, the scale-fusion framework preserves a complete subnetwork similar to HCRN for each scale to extract features and exchanges information between all subnetworks at several stages of the framework. Besides, we propose a training method called species invasion to further improve the performance of HCRN and scale-fusion framework by utilizing a large amount of unlabeled data. Experimental results on the publicly available PRID 2011, iLIDS-VID and MARS multi-shot pedestrian re-id datasets demonstrate the effectiveness of the proposed HCRN, scale-fusion framework and species invasion training method.
引用
收藏
页码:12841 / 12858
页数:17
相关论文
共 50 条
  • [31] Temporal Extension Topology Learning for Video-Based Person Re-identification
    Ning, Jiaqi
    Li, Fei
    Liu, Rujie
    Takeuchi, Shun
    Suzuki, Genta
    COMPUTER VISION - ACCV 2022 WORKSHOPS, 2023, 13848 : 213 - 225
  • [32] TEMPORALLY ALIGNED POOLING REPRESENTATION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
    Gao, Changxin
    Wang, Jin
    Liu, Leyuan
    Yu, Jin-Gang
    Sang, Nong
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4284 - 4288
  • [33] A Duplex Spatiotemporal Filtering Network for Video-based Person Re-identification
    Zheng, Chong
    Wei, Ping
    Zheng, Nanning
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7551 - 7557
  • [34] CONVOLUTIONAL TEMPORAL ATTENTION MODEL FOR VIDEO-BASED PERSON RE-IDENTIFICATION
    Rahman, Tanzila
    Rochan, Mrigank
    Wang, Yang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1102 - 1107
  • [35] Multiscale Aligned SpatialTemporal Interaction for Video-Based Person Re-Identification
    Ran, Zhidan
    Wei, Xuan
    Liu, Wei
    Lu, Xiaobo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8536 - 8546
  • [36] Diverse part attentive network for video-based person re-identification *
    Shu, Xiujun
    Li, Ge
    Wei, Longhui
    Zhong, Jia-Xing
    Zang, Xianghao
    Zhang, Shiliang
    Wang, Yaowei
    Liang, Yongsheng
    Tian, Qi
    PATTERN RECOGNITION LETTERS, 2021, 149 : 17 - 23
  • [37] Diversity Regularized Spatiotemporal Attention for Video-based Person Re-identification
    Li, Shuang
    Bak, Slawomir
    Carr, Peter
    Wang, Xiaogang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 369 - 378
  • [38] Learning Compact Appearance Representation for Video-Based Person Re-Identification
    Zhang, Wei
    Hu, Shengnan
    Liu, Kan
    Zha, Zhengjun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2442 - 2452
  • [39] Learning Bidirectional Temporal Cues for Video-Based Person Re-Identification
    Zhang, Wei
    Yu, Xiaodong
    He, Xuanyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2768 - 2776
  • [40] Video-Based Person Re-Identification Using Unsupervised Tracklet Matching
    Riachy, Chirine
    Khelifi, Fouad
    Bouridane, Ahmed
    IEEE ACCESS, 2019, 7 : 20596 - 20606